Overview
Brought to you by YData
Dataset statistics
| Number of variables | 93 |
|---|---|
| Number of observations | 604720 |
| Missing cells | 35800281 |
| Missing cells (%) | 63.7% |
| Total size in memory | 429.1 MiB |
| Average record size in memory | 744.0 B |
Variable types
| Text | 93 |
|---|
Dataset
| Description | Entomology NMNH Extant Extant Specimen Records 0052484-241126133413365 |
|---|---|
| URL | https://doi.org/10.15468/dl.ptewed |
institutionID has constant value "urn:lsid:biocol.org:col:34871" | Constant |
collectionID has constant value "urn:uuid:18e3cd08-a962-4f0a-b72c-9a0b3600c5ad" | Constant |
institutionCode has constant value "USNM" | Constant |
collectionCode has constant value "ENT" | Constant |
datasetName has constant value "NMNH Extant Biology" | Constant |
organismID has constant value "70 21'9"W" | Constant |
eventType has constant value "-11.7815" | Constant |
waterBody has constant value "DeMarmels" | Constant |
verbatimDepth has constant value "220m inside cave entrance" | Constant |
locationRemarks has constant value "Garrison, Rosser W." | Constant |
verbatimSRS has constant value "Argia" | Constant |
footprintSpatialFit has constant value "Gynacantha membranalis" | Constant |
georeferencedBy has constant value "orichalcea" | Constant |
earliestEonOrLowestEonothem has constant value "Animalia, Arthropoda, Insecta, Odonata, Anisoptera, Aeshnidae" | Constant |
latestEonOrHighestEonothem has constant value "Animalia" | Constant |
earliestEraOrLowestErathem has constant value "Arthropoda" | Constant |
latestEraOrHighestErathem has constant value "Insecta" | Constant |
latestEpochOrHighestSeries has constant value "Pinellas" | Constant |
lowestBiostratigraphicZone has constant value "Gynacantha" | Constant |
formation has constant value "membranalis" | Constant |
identificationReferences has constant value "WGS 84 (EPSG:4326)" | Constant |
originalNameUsage has constant value "Google Earth" | Constant |
kingdom has constant value "Animalia" | Constant |
vernacularName has constant value "Type" | Constant |
catalogNumber has 233452 (38.6%) missing values | Missing |
recordNumber has 604683 (> 99.9%) missing values | Missing |
recordedBy has 203369 (33.6%) missing values | Missing |
sex has 339511 (56.1%) missing values | Missing |
lifeStage has 174155 (28.8%) missing values | Missing |
preparations has 42056 (7.0%) missing values | Missing |
associatedMedia has 390092 (64.5%) missing values | Missing |
occurrenceRemarks has 459346 (76.0%) missing values | Missing |
organismID has 604719 (> 99.9%) missing values | Missing |
eventType has 604719 (> 99.9%) missing values | Missing |
fieldNumber has 600468 (99.3%) missing values | Missing |
eventDate has 239420 (39.6%) missing values | Missing |
startDayOfYear has 244789 (40.5%) missing values | Missing |
endDayOfYear has 244303 (40.4%) missing values | Missing |
year has 239420 (39.6%) missing values | Missing |
month has 246636 (40.8%) missing values | Missing |
day has 270887 (44.8%) missing values | Missing |
verbatimEventDate has 396366 (65.5%) missing values | Missing |
habitat has 604521 (> 99.9%) missing values | Missing |
locationID has 603675 (99.8%) missing values | Missing |
higherGeography has 156093 (25.8%) missing values | Missing |
continent has 604592 (> 99.9%) missing values | Missing |
waterBody has 604719 (> 99.9%) missing values | Missing |
islandGroup has 602200 (99.6%) missing values | Missing |
island has 595353 (98.5%) missing values | Missing |
country has 156115 (25.8%) missing values | Missing |
stateProvince has 173239 (28.6%) missing values | Missing |
county has 254867 (42.1%) missing values | Missing |
locality has 158363 (26.2%) missing values | Missing |
minimumElevationInMeters has 558058 (92.3%) missing values | Missing |
maximumElevationInMeters has 573266 (94.8%) missing values | Missing |
verbatimElevation has 594785 (98.4%) missing values | Missing |
minimumDepthInMeters has 604685 (> 99.9%) missing values | Missing |
maximumDepthInMeters has 604709 (> 99.9%) missing values | Missing |
verbatimDepth has 604714 (> 99.9%) missing values | Missing |
locationRemarks has 604719 (> 99.9%) missing values | Missing |
decimalLatitude has 285696 (47.2%) missing values | Missing |
decimalLongitude has 285696 (47.2%) missing values | Missing |
geodeticDatum has 578337 (95.6%) missing values | Missing |
coordinateUncertaintyInMeters has 592766 (98.0%) missing values | Missing |
coordinatePrecision has 604717 (> 99.9%) missing values | Missing |
pointRadiusSpatialFit has 604718 (> 99.9%) missing values | Missing |
verbatimCoordinates has 604718 (> 99.9%) missing values | Missing |
verbatimLatitude has 523062 (86.5%) missing values | Missing |
verbatimLongitude has 523032 (86.5%) missing values | Missing |
verbatimCoordinateSystem has 604717 (> 99.9%) missing values | Missing |
verbatimSRS has 604719 (> 99.9%) missing values | Missing |
footprintSpatialFit has 604719 (> 99.9%) missing values | Missing |
georeferencedBy has 604719 (> 99.9%) missing values | Missing |
georeferenceProtocol has 366819 (60.7%) missing values | Missing |
georeferenceRemarks has 596270 (98.6%) missing values | Missing |
geologicalContextID has 604716 (> 99.9%) missing values | Missing |
earliestEonOrLowestEonothem has 604719 (> 99.9%) missing values | Missing |
latestEonOrHighestEonothem has 604719 (> 99.9%) missing values | Missing |
earliestEraOrLowestErathem has 604719 (> 99.9%) missing values | Missing |
latestEraOrHighestErathem has 604719 (> 99.9%) missing values | Missing |
earliestPeriodOrLowestSystem has 604716 (> 99.9%) missing values | Missing |
earliestEpochOrLowestSeries has 604717 (> 99.9%) missing values | Missing |
latestEpochOrHighestSeries has 604719 (> 99.9%) missing values | Missing |
latestAgeOrHighestStage has 604717 (> 99.9%) missing values | Missing |
lowestBiostratigraphicZone has 604719 (> 99.9%) missing values | Missing |
formation has 604719 (> 99.9%) missing values | Missing |
identificationQualifier has 603282 (99.8%) missing values | Missing |
typeStatus has 486142 (80.4%) missing values | Missing |
identifiedBy has 455024 (75.2%) missing values | Missing |
identifiedByID has 604718 (> 99.9%) missing values | Missing |
dateIdentified has 604718 (> 99.9%) missing values | Missing |
identificationReferences has 604719 (> 99.9%) missing values | Missing |
originalNameUsage has 604719 (> 99.9%) missing values | Missing |
kingdom has 6300 (1.0%) missing values | Missing |
subgenus has 512525 (84.8%) missing values | Missing |
specificEpithet has 8751 (1.4%) missing values | Missing |
infraspecificEpithet has 571231 (94.5%) missing values | Missing |
taxonRank has 571236 (94.5%) missing values | Missing |
scientificNameAuthorship has 90502 (15.0%) missing values | Missing |
vernacularName has 604718 (> 99.9%) missing values | Missing |
gbifID has unique values | Unique |
occurrenceID has unique values | Unique |
Reproduction
| Analysis started | 2025-01-14 16:39:48.938174 |
|---|---|
| Analysis finished | 2025-01-14 16:40:13.576344 |
| Duration | 24.64 seconds |
| Software version | ydata-profiling vv4.12.1 |
| Download configuration | config.json |
Variables
gbifID
Text
Unique 
| Distinct | 604720 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 604720 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 1321729650 |
|---|---|
| 2nd row | 1320180785 |
| 3rd row | 4403931423 |
| 4th row | 1320185860 |
| 5th row | 1320185980 |
| Value | Count | Frequency (%) |
| 1321729650 | 1 | < 0.1% |
| 1321751610 | 1 | < 0.1% |
| 1828939237 | 1 | < 0.1% |
| 1321753851 | 1 | < 0.1% |
| 4403917418 | 1 | < 0.1% |
| 1321742115 | 1 | < 0.1% |
| 4403931423 | 1 | < 0.1% |
| 1320185860 | 1 | < 0.1% |
| 1320185980 | 1 | < 0.1% |
| 2236094411 | 1 | < 0.1% |
| Other values (604710) | 604710 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1132979 | |
| 3 | 860538 | |
| 2 | 781913 | |
| 0 | 530707 | |
| 8 | 513756 | |
| 9 | 488229 | |
| 7 | 474017 | |
| 4 | 451821 | 7.5% |
| 5 | 410705 | 6.8% |
| 6 | 402535 | 6.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6047200 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1132979 | |
| 3 | 860538 | |
| 2 | 781913 | |
| 0 | 530707 | |
| 8 | 513756 | |
| 9 | 488229 | |
| 7 | 474017 | |
| 4 | 451821 | 7.5% |
| 5 | 410705 | 6.8% |
| 6 | 402535 | 6.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6047200 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1132979 | |
| 3 | 860538 | |
| 2 | 781913 | |
| 0 | 530707 | |
| 8 | 513756 | |
| 9 | 488229 | |
| 7 | 474017 | |
| 4 | 451821 | 7.5% |
| 5 | 410705 | 6.8% |
| 6 | 402535 | 6.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6047200 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1132979 | |
| 3 | 860538 | |
| 2 | 781913 | |
| 0 | 530707 | |
| 8 | 513756 | |
| 9 | 488229 | |
| 7 | 474017 | |
| 4 | 451821 | 7.5% |
| 5 | 410705 | 6.8% |
| 6 | 402535 | 6.7% |
modified
Text
| Distinct | 56593 |
|---|---|
| Distinct (%) | 9.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Unique
| Unique | 30780 ? |
|---|---|
| Unique (%) | 5.1% |
Sample
| 1st row | 2013-09-16 11:56:00 |
|---|---|
| 2nd row | 2016-06-09 14:33:00 |
| 3rd row | 2023-08-23 09:36:00 |
| 4th row | 2023-05-19 10:32:00 |
| 5th row | 2015-10-05 15:58:00 |
| Value | Count | Frequency (%) |
| 2023-05-13 | 60773 | 5.0% |
| 2017-04-17 | 42518 | 3.5% |
| 2014-01-09 | 31212 | 2.6% |
| 2023-05-15 | 20528 | 1.7% |
| 2023-05-12 | 16800 | 1.4% |
| 2015-10-06 | 15979 | 1.3% |
| 2018-02-08 | 14193 | 1.2% |
| 2015-10-05 | 10265 | 0.8% |
| 2017-09-29 | 10242 | 0.8% |
| 11:48:00 | 10115 | 0.8% |
| Other values (3141) | 976815 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2928318 | |
| 1 | 1566388 | |
| 2 | 1372552 | |
| - | 1209440 | |
| : | 1209440 | |
| 604720 | 5.3% | |
| 3 | 593130 | 5.2% |
| 5 | 494673 | 4.3% |
| 4 | 456568 | 4.0% |
| 9 | 314335 | 2.7% |
| Other values (3) | 740116 | 6.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8466080 | |
| Dash Punctuation | 1209440 | 10.5% |
| Other Punctuation | 1209440 | 10.5% |
| Space Separator | 604720 | 5.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2928318 | |
| 1 | 1566388 | |
| 2 | 1372552 | |
| 3 | 593130 | 7.0% |
| 5 | 494673 | 5.8% |
| 4 | 456568 | 5.4% |
| 9 | 314335 | 3.7% |
| 7 | 310958 | 3.7% |
| 6 | 238204 | 2.8% |
| 8 | 190954 | 2.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1209440 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1209440 |
Space Separator
| Value | Count | Frequency (%) |
| 604720 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 11489680 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2928318 | |
| 1 | 1566388 | |
| 2 | 1372552 | |
| - | 1209440 | |
| : | 1209440 | |
| 604720 | 5.3% | |
| 3 | 593130 | 5.2% |
| 5 | 494673 | 4.3% |
| 4 | 456568 | 4.0% |
| 9 | 314335 | 2.7% |
| Other values (3) | 740116 | 6.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11489680 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2928318 | |
| 1 | 1566388 | |
| 2 | 1372552 | |
| - | 1209440 | |
| : | 1209440 | |
| 604720 | 5.3% | |
| 3 | 593130 | 5.2% |
| 5 | 494673 | 4.3% |
| 4 | 456568 | 4.0% |
| 9 | 314335 | 2.7% |
| Other values (3) | 740116 | 6.4% |
institutionID
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 29 |
|---|---|
| Median length | 29 |
| Mean length | 29 |
| Min length | 29 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | urn:lsid:biocol.org:col:34871 |
|---|---|
| 2nd row | urn:lsid:biocol.org:col:34871 |
| 3rd row | urn:lsid:biocol.org:col:34871 |
| 4th row | urn:lsid:biocol.org:col:34871 |
| 5th row | urn:lsid:biocol.org:col:34871 |
| Value | Count | Frequency (%) |
| urn:lsid:biocol.org:col:34871 | 604720 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 2418880 | |
| : | 2418880 | |
| l | 1814160 | 10.3% |
| i | 1209440 | 6.9% |
| r | 1209440 | 6.9% |
| c | 1209440 | 6.9% |
| g | 604720 | 3.4% |
| 7 | 604720 | 3.4% |
| 8 | 604720 | 3.4% |
| 4 | 604720 | 3.4% |
| Other values (8) | 4837760 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 11489680 | |
| Other Punctuation | 3023600 | 17.2% |
| Decimal Number | 3023600 | 17.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 2418880 | |
| l | 1814160 | |
| i | 1209440 | |
| r | 1209440 | |
| c | 1209440 | |
| g | 604720 | 5.3% |
| u | 604720 | 5.3% |
| b | 604720 | 5.3% |
| d | 604720 | 5.3% |
| s | 604720 | 5.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 604720 | |
| 8 | 604720 | |
| 4 | 604720 | |
| 3 | 604720 | |
| 1 | 604720 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 2418880 | |
| . | 604720 | 20.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11489680 | |
| Common | 6047200 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 2418880 | |
| l | 1814160 | |
| i | 1209440 | |
| r | 1209440 | |
| c | 1209440 | |
| g | 604720 | 5.3% |
| u | 604720 | 5.3% |
| b | 604720 | 5.3% |
| d | 604720 | 5.3% |
| s | 604720 | 5.3% |
Common
| Value | Count | Frequency (%) |
| : | 2418880 | |
| 7 | 604720 | 10.0% |
| 8 | 604720 | 10.0% |
| 4 | 604720 | 10.0% |
| 3 | 604720 | 10.0% |
| . | 604720 | 10.0% |
| 1 | 604720 | 10.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17536880 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 2418880 | |
| : | 2418880 | |
| l | 1814160 | 10.3% |
| i | 1209440 | 6.9% |
| r | 1209440 | 6.9% |
| c | 1209440 | 6.9% |
| g | 604720 | 3.4% |
| 7 | 604720 | 3.4% |
| 8 | 604720 | 3.4% |
| 4 | 604720 | 3.4% |
| Other values (8) | 4837760 |
collectionID
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 45 |
|---|---|
| Median length | 45 |
| Mean length | 45 |
| Min length | 45 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | urn:uuid:18e3cd08-a962-4f0a-b72c-9a0b3600c5ad |
|---|---|
| 2nd row | urn:uuid:18e3cd08-a962-4f0a-b72c-9a0b3600c5ad |
| 3rd row | urn:uuid:18e3cd08-a962-4f0a-b72c-9a0b3600c5ad |
| 4th row | urn:uuid:18e3cd08-a962-4f0a-b72c-9a0b3600c5ad |
| 5th row | urn:uuid:18e3cd08-a962-4f0a-b72c-9a0b3600c5ad |
| Value | Count | Frequency (%) |
| urn:uuid:18e3cd08-a962-4f0a-b72c-9a0b3600c5ad | 604720 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3023600 | 11.1% |
| a | 2418880 | 8.9% |
| - | 2418880 | 8.9% |
| d | 1814160 | 6.7% |
| c | 1814160 | 6.7% |
| u | 1814160 | 6.7% |
| 8 | 1209440 | 4.4% |
| 3 | 1209440 | 4.4% |
| : | 1209440 | 4.4% |
| 9 | 1209440 | 4.4% |
| Other values (12) | 9070800 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12094400 | |
| Decimal Number | 11489680 | |
| Dash Punctuation | 2418880 | 8.9% |
| Other Punctuation | 1209440 | 4.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3023600 | |
| 8 | 1209440 | 10.5% |
| 3 | 1209440 | 10.5% |
| 9 | 1209440 | 10.5% |
| 6 | 1209440 | 10.5% |
| 2 | 1209440 | 10.5% |
| 1 | 604720 | 5.3% |
| 4 | 604720 | 5.3% |
| 7 | 604720 | 5.3% |
| 5 | 604720 | 5.3% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2418880 | |
| d | 1814160 | |
| c | 1814160 | |
| u | 1814160 | |
| b | 1209440 | |
| e | 604720 | 5.0% |
| i | 604720 | 5.0% |
| r | 604720 | 5.0% |
| n | 604720 | 5.0% |
| f | 604720 | 5.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2418880 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1209440 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 15118000 | |
| Latin | 12094400 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3023600 | |
| - | 2418880 | |
| 8 | 1209440 | 8.0% |
| 3 | 1209440 | 8.0% |
| : | 1209440 | 8.0% |
| 9 | 1209440 | 8.0% |
| 6 | 1209440 | 8.0% |
| 2 | 1209440 | 8.0% |
| 1 | 604720 | 4.0% |
| 4 | 604720 | 4.0% |
| Other values (2) | 1209440 | 8.0% |
Latin
| Value | Count | Frequency (%) |
| a | 2418880 | |
| d | 1814160 | |
| c | 1814160 | |
| u | 1814160 | |
| b | 1209440 | |
| e | 604720 | 5.0% |
| i | 604720 | 5.0% |
| r | 604720 | 5.0% |
| n | 604720 | 5.0% |
| f | 604720 | 5.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 27212400 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3023600 | 11.1% |
| a | 2418880 | 8.9% |
| - | 2418880 | 8.9% |
| d | 1814160 | 6.7% |
| c | 1814160 | 6.7% |
| u | 1814160 | 6.7% |
| 8 | 1209440 | 4.4% |
| 3 | 1209440 | 4.4% |
| : | 1209440 | 4.4% |
| 9 | 1209440 | 4.4% |
| Other values (12) | 9070800 |
institutionCode
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | USNM |
|---|---|
| 2nd row | USNM |
| 3rd row | USNM |
| 4th row | USNM |
| 5th row | USNM |
| Value | Count | Frequency (%) |
| usnm | 604720 |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 604720 | |
| S | 604720 | |
| N | 604720 | |
| M | 604720 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2418880 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 604720 | |
| S | 604720 | |
| N | 604720 | |
| M | 604720 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2418880 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 604720 | |
| S | 604720 | |
| N | 604720 | |
| M | 604720 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2418880 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 604720 | |
| S | 604720 | |
| N | 604720 | |
| M | 604720 |
collectionCode
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ENT |
|---|---|
| 2nd row | ENT |
| 3rd row | ENT |
| 4th row | ENT |
| 5th row | ENT |
| Value | Count | Frequency (%) |
| ent | 604720 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 604720 | |
| N | 604720 | |
| T | 604720 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1814160 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 604720 | |
| N | 604720 | |
| T | 604720 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1814160 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 604720 | |
| N | 604720 | |
| T | 604720 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1814160 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 604720 | |
| N | 604720 | |
| T | 604720 |
datasetName
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NMNH Extant Biology |
|---|---|
| 2nd row | NMNH Extant Biology |
| 3rd row | NMNH Extant Biology |
| 4th row | NMNH Extant Biology |
| 5th row | NMNH Extant Biology |
| Value | Count | Frequency (%) |
| nmnh | 604720 | |
| extant | 604720 | |
| biology | 604720 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 1209440 | 10.5% |
| 1209440 | 10.5% | |
| t | 1209440 | 10.5% |
| o | 1209440 | 10.5% |
| M | 604720 | 5.3% |
| H | 604720 | 5.3% |
| E | 604720 | 5.3% |
| x | 604720 | 5.3% |
| a | 604720 | 5.3% |
| n | 604720 | 5.3% |
| Other values (5) | 3023600 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6651920 | |
| Uppercase Letter | 3628320 | |
| Space Separator | 1209440 | 10.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 1209440 | |
| o | 1209440 | |
| x | 604720 | |
| a | 604720 | |
| n | 604720 | |
| i | 604720 | |
| l | 604720 | |
| g | 604720 | |
| y | 604720 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1209440 | |
| M | 604720 | |
| H | 604720 | |
| E | 604720 | |
| B | 604720 |
Space Separator
| Value | Count | Frequency (%) |
| 1209440 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10280240 | |
| Common | 1209440 | 10.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 1209440 | |
| t | 1209440 | |
| o | 1209440 | |
| M | 604720 | 5.9% |
| H | 604720 | 5.9% |
| E | 604720 | 5.9% |
| x | 604720 | 5.9% |
| a | 604720 | 5.9% |
| n | 604720 | 5.9% |
| B | 604720 | 5.9% |
| Other values (4) | 2418880 |
Common
| Value | Count | Frequency (%) |
| 1209440 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11489680 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 1209440 | 10.5% |
| 1209440 | 10.5% | |
| t | 1209440 | 10.5% |
| o | 1209440 | 10.5% |
| M | 604720 | 5.3% |
| H | 604720 | 5.3% |
| E | 604720 | 5.3% |
| x | 604720 | 5.3% |
| a | 604720 | 5.3% |
| n | 604720 | 5.3% |
| Other values (5) | 3023600 |
basisOfRecord
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 17 |
| Mean length | 16.99375083 |
| Min length | 16 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PreservedSpecimen |
|---|---|
| 2nd row | PreservedSpecimen |
| 3rd row | PreservedSpecimen |
| 4th row | PreservedSpecimen |
| 5th row | PreservedSpecimen |
| Value | Count | Frequency (%) |
| preservedspecimen | 600941 | |
| humanobservation | 3779 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 3008484 | |
| r | 1205661 | |
| n | 608499 | 5.9% |
| i | 604720 | 5.9% |
| s | 604720 | 5.9% |
| v | 604720 | 5.9% |
| m | 604720 | 5.9% |
| c | 600941 | 5.8% |
| P | 600941 | 5.8% |
| p | 600941 | 5.8% |
| Other values (9) | 1232114 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9067021 | |
| Uppercase Letter | 1209440 | 11.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 3008484 | |
| r | 1205661 | |
| n | 608499 | 6.7% |
| i | 604720 | 6.7% |
| s | 604720 | 6.7% |
| v | 604720 | 6.7% |
| m | 604720 | 6.7% |
| c | 600941 | 6.6% |
| p | 600941 | 6.6% |
| d | 600941 | 6.6% |
| Other values (5) | 22674 | 0.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 600941 | |
| S | 600941 | |
| H | 3779 | 0.3% |
| O | 3779 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10276461 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 3008484 | |
| r | 1205661 | |
| n | 608499 | 5.9% |
| i | 604720 | 5.9% |
| s | 604720 | 5.9% |
| v | 604720 | 5.9% |
| m | 604720 | 5.9% |
| c | 600941 | 5.8% |
| P | 600941 | 5.8% |
| p | 600941 | 5.8% |
| Other values (9) | 1232114 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10276461 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 3008484 | |
| r | 1205661 | |
| n | 608499 | 5.9% |
| i | 604720 | 5.9% |
| s | 604720 | 5.9% |
| v | 604720 | 5.9% |
| m | 604720 | 5.9% |
| c | 600941 | 5.8% |
| P | 600941 | 5.8% |
| p | 600941 | 5.8% |
| Other values (9) | 1232114 |
occurrenceID
Text
Unique 
| Distinct | 604720 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 63 |
|---|---|
| Median length | 63 |
| Mean length | 63 |
| Min length | 63 |
Unique
| Unique | 604720 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | http://n2t.net/ark:/65665/3c83a10d1-1e59-4b08-af5b-28d12d2d0c80 |
|---|---|
| 2nd row | http://n2t.net/ark:/65665/383bb510d-d5ce-4c09-b4c4-bc1482fbaf28 |
| 3rd row | http://n2t.net/ark:/65665/383f13aa6-a5b6-40bc-bddc-b42c557aebfc |
| 4th row | http://n2t.net/ark:/65665/383f4d560-c2d2-485c-906c-b6dad303fd7a |
| 5th row | http://n2t.net/ark:/65665/383f634da-bb58-423c-85f4-a267b04c5ee5 |
| Value | Count | Frequency (%) |
| http://n2t.net/ark:/65665/3c83a10d1-1e59-4b08-af5b-28d12d2d0c80 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3c932a059-56b2-4846-9e97-741d7bdde29c | 1 | < 0.1% |
| http://n2t.net/ark:/65665/384cb9f0c-76d8-41b2-9a2e-351c10a4ab3f | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3c94d744a-d127-4564-9b0c-5d349a138dd0 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/384c3715b-7768-468a-b76b-a68ff7a554d0 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3c8c6462b-a9e9-4efa-9205-6fb4e5ef4e65 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/383f13aa6-a5b6-40bc-bddc-b42c557aebfc | 1 | < 0.1% |
| http://n2t.net/ark:/65665/383f4d560-c2d2-485c-906c-b6dad303fd7a | 1 | < 0.1% |
| http://n2t.net/ark:/65665/383f634da-bb58-423c-85f4-a267b04c5ee5 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3c898aee2-d463-49d7-ad9c-6fd423e170e1 | 1 | < 0.1% |
| Other values (604710) | 604710 |
Most occurring characters
| Value | Count | Frequency (%) |
| / | 3023600 | 7.9% |
| 6 | 2949751 | 7.7% |
| - | 2418880 | 6.3% |
| t | 2418880 | 6.3% |
| 5 | 2343491 | 6.2% |
| a | 1889528 | 5.0% |
| 2 | 1739197 | 4.6% |
| e | 1738583 | 4.6% |
| 3 | 1737642 | 4.6% |
| 4 | 1737535 | 4.6% |
| Other values (16) | 16100273 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 16480712 | |
| Lowercase Letter | 14360008 | |
| Other Punctuation | 4837760 | 12.7% |
| Dash Punctuation | 2418880 | 6.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 2418880 | |
| a | 1889528 | |
| e | 1738583 | |
| b | 1286170 | |
| n | 1209440 | |
| d | 1134463 | |
| c | 1133059 | |
| f | 1131005 | |
| k | 604720 | 4.2% |
| r | 604720 | 4.2% |
| Other values (2) | 1209440 |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 2949751 | |
| 5 | 2343491 | |
| 2 | 1739197 | |
| 3 | 1737642 | |
| 4 | 1737535 | |
| 8 | 1286386 | |
| 9 | 1284901 | |
| 0 | 1134229 | 6.9% |
| 1 | 1134029 | 6.9% |
| 7 | 1133551 | 6.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 3023600 | |
| : | 1209440 | 25.0% |
| . | 604720 | 12.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2418880 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 23737352 | |
| Latin | 14360008 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| / | 3023600 | |
| 6 | 2949751 | |
| - | 2418880 | |
| 5 | 2343491 | |
| 2 | 1739197 | |
| 3 | 1737642 | |
| 4 | 1737535 | |
| 8 | 1286386 | 5.4% |
| 9 | 1284901 | 5.4% |
| : | 1209440 | 5.1% |
| Other values (4) | 4006529 |
Latin
| Value | Count | Frequency (%) |
| t | 2418880 | |
| a | 1889528 | |
| e | 1738583 | |
| b | 1286170 | |
| n | 1209440 | |
| d | 1134463 | |
| c | 1133059 | |
| f | 1131005 | |
| k | 604720 | 4.2% |
| r | 604720 | 4.2% |
| Other values (2) | 1209440 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 38097360 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| / | 3023600 | 7.9% |
| 6 | 2949751 | 7.7% |
| - | 2418880 | 6.3% |
| t | 2418880 | 6.3% |
| 5 | 2343491 | 6.2% |
| a | 1889528 | 5.0% |
| 2 | 1739197 | 4.6% |
| e | 1738583 | 4.6% |
| 3 | 1737642 | 4.6% |
| 4 | 1737535 | 4.6% |
| Other values (16) | 16100273 |
catalogNumber
Text
Missing 
| Distinct | 371254 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 233452 |
| Missing (%) | 38.6% |
| Memory size | 4.6 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 15 |
| Mean length | 15.03873482 |
| Min length | 12 |
Unique
| Unique | 371240 ? |
|---|---|
| Unique (%) | > 99.9% |
Sample
| 1st row | USNMENT00831303 |
|---|---|
| 2nd row | USNMENT00356408 |
| 3rd row | USNMENT01436172 |
| 4th row | USNMENT00357025 |
| 5th row | USNMENT00314717 |
| Value | Count | Frequency (%) |
| usnment00377587 | 2 | < 0.1% |
| usnment00381323 | 2 | < 0.1% |
| usnment00937212 | 2 | < 0.1% |
| usnment00377617 | 2 | < 0.1% |
| usnment00536541 | 2 | < 0.1% |
| usnment00533165 | 2 | < 0.1% |
| usnment00385557 | 2 | < 0.1% |
| usnment01200936 | 2 | < 0.1% |
| usnment00385731 | 2 | < 0.1% |
| usnment00937214 | 2 | < 0.1% |
| Other values (371244) | 371248 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 804733 | |
| N | 741878 | |
| 1 | 377023 | 6.8% |
| S | 371268 | 6.6% |
| U | 371224 | 6.6% |
| M | 371224 | 6.6% |
| E | 370648 | 6.6% |
| T | 370648 | 6.6% |
| 3 | 302855 | 5.4% |
| 4 | 225937 | 4.0% |
| Other values (11) | 1275963 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2982343 | |
| Uppercase Letter | 2596978 | |
| Other Punctuation | 4078 | 0.1% |
| Lowercase Letter | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 804733 | |
| 1 | 377023 | |
| 3 | 302855 | 10.2% |
| 4 | 225937 | 7.6% |
| 2 | 225472 | 7.6% |
| 5 | 215981 | 7.2% |
| 8 | 215588 | 7.2% |
| 7 | 210834 | 7.1% |
| 6 | 202437 | 6.8% |
| 9 | 201483 | 6.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 741878 | |
| S | 371268 | |
| U | 371224 | |
| M | 371224 | |
| E | 370648 | |
| T | 370648 | |
| C | 44 | < 0.1% |
| A | 44 | < 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| b | 1 | |
| a | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4078 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2986421 | |
| Latin | 2596980 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 804733 | |
| 1 | 377023 | |
| 3 | 302855 | 10.1% |
| 4 | 225937 | 7.6% |
| 2 | 225472 | 7.5% |
| 5 | 215981 | 7.2% |
| 8 | 215588 | 7.2% |
| 7 | 210834 | 7.1% |
| 6 | 202437 | 6.8% |
| 9 | 201483 | 6.7% |
Latin
| Value | Count | Frequency (%) |
| N | 741878 | |
| S | 371268 | |
| U | 371224 | |
| M | 371224 | |
| E | 370648 | |
| T | 370648 | |
| C | 44 | < 0.1% |
| A | 44 | < 0.1% |
| b | 1 | < 0.1% |
| a | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5583401 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 804733 | |
| N | 741878 | |
| 1 | 377023 | 6.8% |
| S | 371268 | 6.6% |
| U | 371224 | 6.6% |
| M | 371224 | 6.6% |
| E | 370648 | 6.6% |
| T | 370648 | 6.6% |
| 3 | 302855 | 5.4% |
| 4 | 225937 | 4.0% |
| Other values (11) | 1275963 |
recordNumber
Text
Missing 
| Distinct | 33 |
|---|---|
| Distinct (%) | 89.2% |
| Missing | 604683 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 43 |
|---|---|
| Median length | 26 |
| Mean length | 17.13513514 |
| Min length | 4 |
Unique
| Unique | 32 ? |
|---|---|
| Unique (%) | 86.5% |
Sample
| 1st row | Collection number "14,957" |
|---|---|
| 2nd row | Lot 607, Sub 182 |
| 3rd row | 4012 |
| 4th row | Dognin Collection |
| 5th row | 12.097 |
| Value | Count | Frequency (%) |
| collection | 10 | 10.0% |
| no | 9 | 9.0% |
| walsingham | 7 | 7.0% |
| dognin | 5 | 5.0% |
| hopkins | 3 | 3.0% |
| quaintance | 2 | 2.0% |
| wlsm | 2 | 2.0% |
| townes | 2 | 2.0% |
| number | 2 | 2.0% |
| from | 2 | 2.0% |
| Other values (56) | 56 |
Most occurring characters
| Value | Count | Frequency (%) |
| 63 | 9.9% | |
| o | 52 | 8.2% |
| n | 47 | 7.4% |
| l | 39 | 6.2% |
| i | 33 | 5.2% |
| . | 26 | 4.1% |
| e | 25 | 3.9% |
| a | 22 | 3.5% |
| t | 19 | 3.0% |
| 1 | 19 | 3.0% |
| Other values (47) | 289 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 348 | |
| Decimal Number | 114 | 18.0% |
| Uppercase Letter | 67 | 10.6% |
| Space Separator | 63 | 9.9% |
| Other Punctuation | 38 | 6.0% |
| Dash Punctuation | 2 | 0.3% |
| Open Punctuation | 1 | 0.2% |
| Close Punctuation | 1 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 52 | |
| n | 47 | |
| l | 39 | |
| i | 33 | |
| e | 25 | 7.2% |
| a | 22 | 6.3% |
| t | 19 | 5.5% |
| c | 18 | 5.2% |
| s | 16 | 4.6% |
| g | 14 | 4.0% |
| Other values (11) | 63 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 14 | |
| W | 9 | |
| N | 9 | |
| H | 6 | |
| D | 5 | 7.5% |
| S | 4 | 6.0% |
| M | 3 | 4.5% |
| Q | 2 | 3.0% |
| T | 2 | 3.0% |
| U | 2 | 3.0% |
| Other values (9) | 11 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 19 | |
| 7 | 15 | |
| 0 | 14 | |
| 8 | 14 | |
| 5 | 12 | |
| 9 | 12 | |
| 4 | 8 | |
| 2 | 8 | |
| 6 | 7 | 6.1% |
| 3 | 5 | 4.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 26 | |
| " | 10 | 26.3% |
| , | 2 | 5.3% |
Space Separator
| Value | Count | Frequency (%) |
| 63 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 415 | |
| Common | 219 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 52 | 12.5% |
| n | 47 | 11.3% |
| l | 39 | 9.4% |
| i | 33 | 8.0% |
| e | 25 | 6.0% |
| a | 22 | 5.3% |
| t | 19 | 4.6% |
| c | 18 | 4.3% |
| s | 16 | 3.9% |
| C | 14 | 3.4% |
| Other values (30) | 130 |
Common
| Value | Count | Frequency (%) |
| 63 | ||
| . | 26 | |
| 1 | 19 | 8.7% |
| 7 | 15 | 6.8% |
| 0 | 14 | 6.4% |
| 8 | 14 | 6.4% |
| 5 | 12 | 5.5% |
| 9 | 12 | 5.5% |
| " | 10 | 4.6% |
| 4 | 8 | 3.7% |
| Other values (7) | 26 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 634 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 63 | 9.9% | |
| o | 52 | 8.2% |
| n | 47 | 7.4% |
| l | 39 | 6.2% |
| i | 33 | 5.2% |
| . | 26 | 4.1% |
| e | 25 | 3.9% |
| a | 22 | 3.5% |
| t | 19 | 3.0% |
| 1 | 19 | 3.0% |
| Other values (47) | 289 |
recordedBy
Text
Missing 
| Distinct | 18727 |
|---|---|
| Distinct (%) | 4.7% |
| Missing | 203369 |
| Missing (%) | 33.6% |
| Memory size | 4.6 MiB |
Length
| Max length | 90 |
|---|---|
| Median length | 84 |
| Mean length | 11.25701693 |
| Min length | 1 |
Unique
| Unique | 9104 ? |
|---|---|
| Unique (%) | 2.3% |
Sample
| 1st row | M. Ortiz B. |
|---|---|
| 2nd row | [Not Stated] |
| 3rd row | S. Roble |
| 4th row | [Not Stated] |
| 5th row | C. Flint |
| Value | Count | Frequency (%) |
| not | 65723 | 7.2% |
| stated | 65707 | 7.2% |
| l | 40187 | 4.4% |
| 39883 | 4.4% | |
| j | 36893 | 4.0% |
| macior | 31234 | 3.4% |
| d | 28472 | 3.1% |
| c | 27158 | 3.0% |
| r | 25638 | 2.8% |
| b | 22051 | 2.4% |
| Other values (10691) | 530867 |
Most occurring characters
| Value | Count | Frequency (%) |
| 512462 | 11.3% | |
| . | 355587 | 7.9% |
| t | 305186 | 6.8% |
| a | 299390 | 6.6% |
| e | 290122 | 6.4% |
| o | 240216 | 5.3% |
| r | 229316 | 5.1% |
| i | 173792 | 3.8% |
| n | 169878 | 3.8% |
| l | 136877 | 3.0% |
| Other values (73) | 1805189 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2588000 | |
| Uppercase Letter | 878362 | 19.4% |
| Space Separator | 512462 | 11.3% |
| Other Punctuation | 405464 | 9.0% |
| Open Punctuation | 65758 | 1.5% |
| Close Punctuation | 65758 | 1.5% |
| Dash Punctuation | 2190 | < 0.1% |
| Decimal Number | 21 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 305186 | |
| a | 299390 | |
| e | 290122 | |
| o | 240216 | |
| r | 229316 | |
| i | 173792 | 6.7% |
| n | 169878 | 6.6% |
| l | 136877 | 5.3% |
| d | 115279 | 4.5% |
| s | 95763 | 3.7% |
| Other values (25) | 532181 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 116412 | |
| M | 90630 | 10.3% |
| N | 79769 | 9.1% |
| B | 56916 | 6.5% |
| C | 54340 | 6.2% |
| L | 51918 | 5.9% |
| D | 47335 | 5.4% |
| J | 42565 | 4.8% |
| W | 40161 | 4.6% |
| G | 38221 | 4.4% |
| Other values (17) | 260095 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 8 | |
| 5 | 5 | |
| 0 | 2 | 9.5% |
| 2 | 2 | 9.5% |
| 6 | 2 | 9.5% |
| 9 | 1 | 4.8% |
| 3 | 1 | 4.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 355587 | |
| & | 39874 | 9.8% |
| , | 9364 | 2.3% |
| ' | 622 | 0.2% |
| ? | 16 | < 0.1% |
| / | 1 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 65747 | |
| ( | 10 | < 0.1% |
| { | 1 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 65747 | |
| ) | 10 | < 0.1% |
| } | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 512462 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2190 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3466362 | |
| Common | 1051653 | 23.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 305186 | 8.8% |
| a | 299390 | 8.6% |
| e | 290122 | 8.4% |
| o | 240216 | 6.9% |
| r | 229316 | 6.6% |
| i | 173792 | 5.0% |
| n | 169878 | 4.9% |
| l | 136877 | 3.9% |
| S | 116412 | 3.4% |
| d | 115279 | 3.3% |
| Other values (52) | 1389894 |
Common
| Value | Count | Frequency (%) |
| 512462 | ||
| . | 355587 | |
| [ | 65747 | 6.3% |
| ] | 65747 | 6.3% |
| & | 39874 | 3.8% |
| , | 9364 | 0.9% |
| - | 2190 | 0.2% |
| ' | 622 | 0.1% |
| ? | 16 | < 0.1% |
| ) | 10 | < 0.1% |
| Other values (11) | 34 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4517525 | |
| None | 490 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 512462 | 11.3% | |
| . | 355587 | 7.9% |
| t | 305186 | 6.8% |
| a | 299390 | 6.6% |
| e | 290122 | 6.4% |
| o | 240216 | 5.3% |
| r | 229316 | 5.1% |
| i | 173792 | 3.8% |
| n | 169878 | 3.8% |
| l | 136877 | 3.0% |
| Other values (63) | 1804699 |
None
| Value | Count | Frequency (%) |
| ñ | 238 | |
| ü | 107 | |
| á | 95 | 19.4% |
| ä | 13 | 2.7% |
| é | 12 | 2.4% |
| ö | 12 | 2.4% |
| ó | 8 | 1.6% |
| Á | 2 | 0.4% |
| č | 2 | 0.4% |
| â | 1 | 0.2% |
individualCount
Text
| Distinct | 941 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 3136 |
| Missing (%) | 0.5% |
| Memory size | 4.6 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 1 |
| Mean length | 1.044863228 |
| Min length | 1 |
Unique
| Unique | 393 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 7 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 1 | 548305 | |
| 2 | 10273 | 1.7% |
| 3 | 6619 | 1.1% |
| 4 | 4295 | 0.7% |
| 5 | 2621 | 0.4% |
| 6 | 2340 | 0.4% |
| 7 | 1822 | 0.3% |
| 8 | 1527 | 0.3% |
| 10 | 1306 | 0.2% |
| 9 | 1254 | 0.2% |
| Other values (931) | 21222 | 3.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 560888 | |
| 2 | 17645 | 2.8% |
| 3 | 11801 | 1.9% |
| 4 | 8337 | 1.3% |
| 5 | 6511 | 1.0% |
| 0 | 6142 | 1.0% |
| 6 | 5349 | 0.9% |
| 7 | 4420 | 0.7% |
| 8 | 3992 | 0.6% |
| 9 | 3488 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 628573 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 560888 | |
| 2 | 17645 | 2.8% |
| 3 | 11801 | 1.9% |
| 4 | 8337 | 1.3% |
| 5 | 6511 | 1.0% |
| 0 | 6142 | 1.0% |
| 6 | 5349 | 0.9% |
| 7 | 4420 | 0.7% |
| 8 | 3992 | 0.6% |
| 9 | 3488 | 0.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 628573 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 560888 | |
| 2 | 17645 | 2.8% |
| 3 | 11801 | 1.9% |
| 4 | 8337 | 1.3% |
| 5 | 6511 | 1.0% |
| 0 | 6142 | 1.0% |
| 6 | 5349 | 0.9% |
| 7 | 4420 | 0.7% |
| 8 | 3992 | 0.6% |
| 9 | 3488 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 628573 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 560888 | |
| 2 | 17645 | 2.8% |
| 3 | 11801 | 1.9% |
| 4 | 8337 | 1.3% |
| 5 | 6511 | 1.0% |
| 0 | 6142 | 1.0% |
| 6 | 5349 | 0.9% |
| 7 | 4420 | 0.7% |
| 8 | 3992 | 0.6% |
| 9 | 3488 | 0.6% |
sex
Text
Missing 
| Distinct | 95 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 339511 |
| Missing (%) | 56.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 40 |
|---|---|
| Median length | 34 |
| Mean length | 5.351737686 |
| Min length | 4 |
Unique
| Unique | 24 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Worker |
|---|---|
| 2nd row | Male |
| 3rd row | Male |
| 4th row | Male |
| 5th row | Male |
| Value | Count | Frequency (%) |
| male | 137835 | |
| female | 93225 | |
| unknown | 34039 | 12.4% |
| worker | 7022 | 2.6% |
| 1487 | 0.5% | |
| unable | 240 | 0.1% |
| to | 240 | 0.1% |
| determine | 240 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 332267 | |
| l | 231300 | |
| a | 231300 | |
| M | 120716 | 8.5% |
| m | 110584 | 7.8% |
| n | 102597 | 7.2% |
| F | 80595 | 5.7% |
| o | 41301 | 2.9% |
| k | 41061 | 2.9% |
| U | 34224 | 2.4% |
| Other values (13) | 93384 | 6.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1152779 | |
| Uppercase Letter | 242396 | 17.1% |
| Other Punctuation | 15035 | 1.1% |
| Space Separator | 9119 | 0.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 332267 | |
| l | 231300 | |
| a | 231300 | |
| m | 110584 | 9.6% |
| n | 102597 | 8.9% |
| o | 41301 | 3.6% |
| k | 41061 | 3.6% |
| w | 34200 | 3.0% |
| r | 14284 | 1.2% |
| f | 12630 | 1.1% |
| Other values (5) | 1255 | 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 120716 | |
| F | 80595 | |
| U | 34224 | 14.1% |
| W | 6861 | 2.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 13780 | |
| & | 1253 | 8.3% |
| , | 2 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 9119 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1395175 | |
| Common | 24154 | 1.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 332267 | |
| l | 231300 | |
| a | 231300 | |
| M | 120716 | 8.7% |
| m | 110584 | 7.9% |
| n | 102597 | 7.4% |
| F | 80595 | 5.8% |
| o | 41301 | 3.0% |
| k | 41061 | 2.9% |
| U | 34224 | 2.5% |
| Other values (9) | 69230 | 5.0% |
Common
| Value | Count | Frequency (%) |
| ; | 13780 | |
| 9119 | ||
| & | 1253 | 5.2% |
| , | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1419329 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 332267 | |
| l | 231300 | |
| a | 231300 | |
| M | 120716 | 8.5% |
| m | 110584 | 7.8% |
| n | 102597 | 7.2% |
| F | 80595 | 5.7% |
| o | 41301 | 2.9% |
| k | 41061 | 2.9% |
| U | 34224 | 2.4% |
| Other values (13) | 93384 | 6.6% |
lifeStage
Text
Missing 
| Distinct | 178 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 174155 |
| Missing (%) | 28.8% |
| Memory size | 4.6 MiB |
Length
| Max length | 59 |
|---|---|
| Median length | 5 |
| Mean length | 5.285092843 |
| Min length | 1 |
Unique
| Unique | 60 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Adult |
|---|---|
| 2nd row | Adult |
| 3rd row | Adult |
| 4th row | Adult |
| 5th row | Adult |
| Value | Count | Frequency (%) |
| adult | 425078 | |
| immature | 4871 | 1.1% |
| wings | 3368 | 0.8% |
| alate | 1659 | 0.4% |
| apterous | 1572 | 0.4% |
| pupa | 1198 | 0.3% |
| soldier | 1080 | 0.2% |
| worker | 1007 | 0.2% |
| larva | 943 | 0.2% |
| reproductive | 667 | 0.2% |
| Other values (46) | 2928 | 0.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 434662 | |
| u | 434253 | |
| l | 428547 | |
| d | 426929 | |
| A | 392238 | |
| a | 47160 | 2.1% |
| 13806 | 0.6% | |
| e | 13577 | 0.6% |
| r | 11681 | 0.5% |
| m | 10485 | 0.5% |
| Other values (35) | 62238 | 2.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1844501 | |
| Uppercase Letter | 406785 | 17.9% |
| Space Separator | 13806 | 0.6% |
| Other Punctuation | 10463 | 0.5% |
| Open Punctuation | 10 | < 0.1% |
| Close Punctuation | 10 | < 0.1% |
| Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 434662 | |
| u | 434253 | |
| l | 428547 | |
| d | 426929 | |
| a | 47160 | 2.6% |
| e | 13577 | 0.7% |
| r | 11681 | 0.6% |
| m | 10485 | 0.6% |
| i | 6614 | 0.4% |
| n | 5371 | 0.3% |
| Other values (12) | 25222 | 1.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 392238 | |
| I | 4857 | 1.2% |
| W | 4377 | 1.1% |
| P | 1253 | 0.3% |
| S | 1098 | 0.3% |
| L | 809 | 0.2% |
| R | 668 | 0.2% |
| U | 667 | 0.2% |
| N | 399 | 0.1% |
| T | 177 | < 0.1% |
| Other values (8) | 242 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 13806 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 10463 |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 10 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 10 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2251286 | |
| Common | 24290 | 1.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 434662 | |
| u | 434253 | |
| l | 428547 | |
| d | 426929 | |
| A | 392238 | |
| a | 47160 | 2.1% |
| e | 13577 | 0.6% |
| r | 11681 | 0.5% |
| m | 10485 | 0.5% |
| i | 6614 | 0.3% |
| Other values (30) | 45140 | 2.0% |
Common
| Value | Count | Frequency (%) |
| 13806 | ||
| ; | 10463 | |
| [ | 10 | < 0.1% |
| ] | 10 | < 0.1% |
| - | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2275576 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 434662 | |
| u | 434253 | |
| l | 428547 | |
| d | 426929 | |
| A | 392238 | |
| a | 47160 | 2.1% |
| 13806 | 0.6% | |
| e | 13577 | 0.6% |
| r | 11681 | 0.5% |
| m | 10485 | 0.5% |
| Other values (35) | 62238 | 2.7% |
preparations
Text
Missing 
| Distinct | 272 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 42056 |
| Missing (%) | 7.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 93 |
|---|---|
| Median length | 6 |
| Mean length | 6.839828032 |
| Min length | 1 |
Unique
| Unique | 112 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Pinned |
|---|---|
| 2nd row | Pinned |
| 3rd row | Pinned |
| 4th row | Envelope |
| 5th row | Pinned |
| Value | Count | Frequency (%) |
| pinned | 389792 | |
| envelope | 114691 | 18.8% |
| slide | 65067 | 10.7% |
| vial | 9498 | 1.6% |
| ethanol | 6482 | 1.1% |
| section | 3747 | 0.6% |
| on | 3653 | 0.6% |
| 3195 | 0.5% | |
| ink | 3151 | 0.5% |
| pen | 3072 | 0.5% |
| Other values (93) | 7800 | 1.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 916570 | |
| e | 701224 | |
| i | 472718 | |
| d | 455956 | |
| P | 366246 | 9.5% |
| l | 199786 | 5.2% |
| p | 142808 | 3.7% |
| o | 133897 | 3.5% |
| v | 114853 | 3.0% |
| E | 112904 | 2.9% |
| Other values (48) | 231563 | 6.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3214548 | |
| Uppercase Letter | 553430 | 14.4% |
| Space Separator | 47484 | 1.2% |
| Other Punctuation | 32282 | 0.8% |
| Decimal Number | 781 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 916570 | |
| e | 701224 | |
| i | 472718 | |
| d | 455956 | |
| l | 199786 | 6.2% |
| p | 142808 | 4.4% |
| o | 133897 | 4.2% |
| v | 114853 | 3.6% |
| a | 18598 | 0.6% |
| s | 17527 | 0.5% |
| Other values (15) | 40611 | 1.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 366246 | |
| E | 112904 | 20.4% |
| S | 56010 | 10.1% |
| V | 9718 | 1.8% |
| I | 3164 | 0.6% |
| B | 2575 | 0.5% |
| R | 887 | 0.2% |
| M | 523 | 0.1% |
| C | 505 | 0.1% |
| D | 388 | 0.1% |
| Other values (10) | 510 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 28582 | |
| & | 3195 | 9.9% |
| % | 389 | 1.2% |
| . | 69 | 0.2% |
| , | 28 | 0.1% |
| / | 15 | < 0.1% |
| ? | 4 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 389 | |
| 7 | 389 | |
| 2 | 1 | 0.1% |
| 3 | 1 | 0.1% |
| 9 | 1 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 47484 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3767978 | |
| Common | 80547 | 2.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 916570 | |
| e | 701224 | |
| i | 472718 | |
| d | 455956 | |
| P | 366246 | 9.7% |
| l | 199786 | 5.3% |
| p | 142808 | 3.8% |
| o | 133897 | 3.6% |
| v | 114853 | 3.0% |
| E | 112904 | 3.0% |
| Other values (35) | 151016 | 4.0% |
Common
| Value | Count | Frequency (%) |
| 47484 | ||
| ; | 28582 | |
| & | 3195 | 4.0% |
| 5 | 389 | 0.5% |
| % | 389 | 0.5% |
| 7 | 389 | 0.5% |
| . | 69 | 0.1% |
| , | 28 | < 0.1% |
| / | 15 | < 0.1% |
| ? | 4 | < 0.1% |
| Other values (3) | 3 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3848525 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 916570 | |
| e | 701224 | |
| i | 472718 | |
| d | 455956 | |
| P | 366246 | 9.5% |
| l | 199786 | 5.2% |
| p | 142808 | 3.7% |
| o | 133897 | 3.5% |
| v | 114853 | 3.0% |
| E | 112904 | 2.9% |
| Other values (48) | 231563 | 6.0% |
associatedMedia
Text
Missing 
| Distinct | 214407 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 390092 |
| Missing (%) | 64.5% |
| Memory size | 4.6 MiB |
Length
| Max length | 259 |
|---|---|
| Median length | 49 |
| Mean length | 52.23455467 |
| Min length | 48 |
Unique
| Unique | 214268 ? |
|---|---|
| Unique (%) | 99.8% |
Sample
| 1st row | https://collections.nmnh.si.edu/media/?i=16421668 |
|---|---|
| 2nd row | https://collections.nmnh.si.edu/media/?i=16411146 |
| 3rd row | https://collections.nmnh.si.edu/media/?i=16342640 |
| 4th row | https://collections.nmnh.si.edu/media/?i=16365128 |
| 5th row | https://collections.nmnh.si.edu/media/?i=16326001 |
| Value | Count | Frequency (%) |
| https://collections.nmnh.si.edu/media/?i=16612365 | 38 | < 0.1% |
| https://collections.nmnh.si.edu/media/?i=16556913 | 19 | < 0.1% |
| https://collections.nmnh.si.edu/media/?i=16558066 | 14 | < 0.1% |
| 16556913 | 12 | < 0.1% |
| https://collections.nmnh.si.edu/media/?i=16623013 | 10 | < 0.1% |
| 16574611 | 9 | < 0.1% |
| https://collections.nmnh.si.edu/media/?i=16945972 | 7 | < 0.1% |
| 16561531 | 7 | < 0.1% |
| 16556901 | 7 | < 0.1% |
| https://collections.nmnh.si.edu/media/?i=16947492 | 5 | < 0.1% |
| Other values (284058) | 287167 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 858512 | 7.7% |
| / | 858512 | 7.7% |
| e | 643884 | 5.7% |
| t | 643884 | 5.7% |
| s | 643884 | 5.7% |
| . | 643884 | 5.7% |
| n | 643884 | 5.7% |
| 1 | 468009 | 4.2% |
| l | 429256 | 3.8% |
| o | 429256 | 3.8% |
| Other values (21) | 4948033 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6653468 | |
| Decimal Number | 2265916 | 20.2% |
| Other Punctuation | 2004319 | 17.9% |
| Math Symbol | 214628 | 1.9% |
| Space Separator | 72667 | 0.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 858512 | |
| e | 643884 | |
| t | 643884 | |
| s | 643884 | |
| n | 643884 | |
| l | 429256 | 6.5% |
| o | 429256 | 6.5% |
| c | 429256 | 6.5% |
| m | 429256 | 6.5% |
| d | 429256 | 6.5% |
| Other values (4) | 1073140 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 468009 | |
| 6 | 265288 | |
| 3 | 251481 | |
| 4 | 215243 | |
| 0 | 211833 | |
| 9 | 201543 | |
| 7 | 178897 | 7.9% |
| 2 | 166354 | 7.3% |
| 5 | 164303 | 7.3% |
| 8 | 142965 | 6.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 858512 | |
| . | 643884 | |
| ? | 214628 | 10.7% |
| : | 214628 | 10.7% |
| ; | 72667 | 3.6% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 214628 |
Space Separator
| Value | Count | Frequency (%) |
| 72667 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6653468 | |
| Common | 4557530 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| / | 858512 | |
| . | 643884 | |
| 1 | 468009 | |
| 6 | 265288 | 5.8% |
| 3 | 251481 | 5.5% |
| 4 | 215243 | 4.7% |
| = | 214628 | 4.7% |
| ? | 214628 | 4.7% |
| : | 214628 | 4.7% |
| 0 | 211833 | 4.6% |
| Other values (7) | 999396 |
Latin
| Value | Count | Frequency (%) |
| i | 858512 | |
| e | 643884 | |
| t | 643884 | |
| s | 643884 | |
| n | 643884 | |
| l | 429256 | 6.5% |
| o | 429256 | 6.5% |
| c | 429256 | 6.5% |
| m | 429256 | 6.5% |
| d | 429256 | 6.5% |
| Other values (4) | 1073140 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11210998 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 858512 | 7.7% |
| / | 858512 | 7.7% |
| e | 643884 | 5.7% |
| t | 643884 | 5.7% |
| s | 643884 | 5.7% |
| . | 643884 | 5.7% |
| n | 643884 | 5.7% |
| 1 | 468009 | 4.2% |
| l | 429256 | 3.8% |
| o | 429256 | 3.8% |
| Other values (21) | 4948033 |
Missing 
| Distinct | 31235 |
|---|---|
| Distinct (%) | 21.5% |
| Missing | 459346 |
| Missing (%) | 76.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 151446 |
|---|---|
| Median length | 89176 |
| Mean length | 77.46756641 |
| Min length | 1 |
Unique
| Unique | 27503 ? |
|---|---|
| Unique (%) | 18.9% |
Sample
| 1st row | One leg removed for genetic sampling while on loan to GUELPH. |
|---|---|
| 2nd row | Lindroth, 1975:125: (the loc. is no doubt wrong). |
| 3rd row | F. Monros Coll. 1959 G.M. Greene Coll. C. Schaeffer Coll. Shoemaker Coll. 1956 A. Nicolay Coll. 1950 L.W. Saylor Coll. |
| 4th row | Specimen data is incomplete. Phase 1 of data capture inlcluded USNMENT#s and general locality. |
| 5th row | One leg removed for genetic sampling while on loan to GUELPH. |
| Value | Count | Frequency (%) |
| digitization | 56218 | 3.4% |
| by | 48162 | 2.9% |
| digital | 44075 | 2.7% |
| transcribed | 44039 | 2.7% |
| volunteers | 44039 | 2.7% |
| of | 42600 | 2.6% |
| on | 41034 | 2.5% |
| to | 36795 | 2.2% |
| loan | 36495 | 2.2% |
| for | 36258 | 2.2% |
| Other values (46961) | 1230406 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1496225 | 13.3% | |
| e | 833183 | 7.4% |
| i | 803329 | 7.1% |
| a | 671754 | 6.0% |
| t | 666687 | 5.9% |
| o | 651617 | 5.8% |
| n | 613739 | 5.4% |
| r | 553298 | 4.9% |
| s | 447996 | 4.0% |
| l | 427415 | 3.8% |
| Other values (110) | 4096527 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7971658 | |
| Space Separator | 1496225 | 13.3% |
| Uppercase Letter | 1027865 | 9.1% |
| Other Punctuation | 295394 | 2.6% |
| Decimal Number | 259592 | 2.3% |
| Control | 101184 | 0.9% |
| Open Punctuation | 39698 | 0.4% |
| Close Punctuation | 39677 | 0.4% |
| Dash Punctuation | 18103 | 0.2% |
| Math Symbol | 12131 | 0.1% |
| Other values (7) | 243 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 833183 | |
| i | 803329 | |
| a | 671754 | 8.4% |
| t | 666687 | 8.4% |
| o | 651617 | 8.2% |
| n | 613739 | 7.7% |
| r | 553298 | 6.9% |
| s | 447996 | 5.6% |
| l | 427415 | 5.4% |
| d | 318849 | 4.0% |
| Other values (26) | 1983791 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 116768 | |
| O | 101082 | 9.8% |
| S | 101033 | 9.8% |
| E | 82579 | 8.0% |
| D | 70631 | 6.9% |
| I | 64203 | 6.2% |
| T | 62905 | 6.1% |
| M | 62182 | 6.0% |
| U | 54826 | 5.3% |
| L | 50803 | 4.9% |
| Other values (19) | 260853 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 173902 | |
| ; | 47091 | 15.9% |
| , | 31645 | 10.7% |
| : | 15772 | 5.3% |
| # | 9313 | 3.2% |
| / | 7124 | 2.4% |
| ' | 5317 | 1.8% |
| " | 2639 | 0.9% |
| & | 1678 | 0.6% |
| ? | 818 | 0.3% |
| Other values (7) | 95 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 55149 | |
| 9 | 38333 | |
| 0 | 29746 | |
| 2 | 27054 | |
| 6 | 20186 | 7.8% |
| 3 | 19885 | 7.7% |
| 5 | 19160 | 7.4% |
| 8 | 17051 | 6.6% |
| 7 | 16622 | 6.4% |
| 4 | 16406 | 6.3% |
Math Symbol
| Value | Count | Frequency (%) |
| | | 10552 | |
| = | 836 | 6.9% |
| + | 720 | 5.9% |
| > | 11 | 0.1% |
| ~ | 8 | 0.1% |
| < | 4 | < 0.1% |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 17 | |
| ♂ | 14 | |
| ♀ | 4 | 11.1% |
| © | 1 | 2.8% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 33578 | |
| [ | 6109 | 15.4% |
| { | 11 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 33570 | |
| ] | 6096 | 15.4% |
| } | 11 | < 0.1% |
Control
| Value | Count | Frequency (%) |
| 100652 | ||
| 532 | 0.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 18102 | |
| — | 1 | < 0.1% |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 1 | |
| £ | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1496225 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 149 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 23 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 23 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ^ | 9 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʼ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8999521 | |
| Common | 2262249 | 20.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 833183 | 9.3% |
| i | 803329 | 8.9% |
| a | 671754 | 7.5% |
| t | 666687 | 7.4% |
| o | 651617 | 7.2% |
| n | 613739 | 6.8% |
| r | 553298 | 6.1% |
| s | 447996 | 5.0% |
| l | 427415 | 4.7% |
| d | 318849 | 3.5% |
| Other values (54) | 3011654 |
Common
| Value | Count | Frequency (%) |
| 1496225 | ||
| . | 173902 | 7.7% |
| 100652 | 4.4% | |
| 1 | 55149 | 2.4% |
| ; | 47091 | 2.1% |
| 9 | 38333 | 1.7% |
| ( | 33578 | 1.5% |
| ) | 33570 | 1.5% |
| , | 31645 | 1.4% |
| 0 | 29746 | 1.3% |
| Other values (46) | 222358 | 9.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11261640 | |
| None | 62 | < 0.1% |
| Punctuation | 49 | < 0.1% |
| Misc Symbols | 18 | < 0.1% |
| Modifier Letters | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1496225 | 13.3% | |
| e | 833183 | 7.4% |
| i | 803329 | 7.1% |
| a | 671754 | 6.0% |
| t | 666687 | 5.9% |
| o | 651617 | 5.8% |
| n | 613739 | 5.4% |
| r | 553298 | 4.9% |
| s | 447996 | 4.0% |
| l | 427415 | 3.8% |
| Other values (85) | 4096397 |
Punctuation
| Value | Count | Frequency (%) |
| “ | 23 | |
| ” | 23 | |
| … | 2 | 4.1% |
| — | 1 | 2.0% |
None
| Value | Count | Frequency (%) |
| ° | 17 | |
| · | 7 | |
| á | 6 | 9.7% |
| é | 4 | 6.5% |
| ó | 4 | 6.5% |
| ö | 4 | 6.5% |
| ø | 3 | 4.8% |
| í | 3 | 4.8% |
| µ | 2 | 3.2% |
| ü | 2 | 3.2% |
| Other values (8) | 10 |
Misc Symbols
| Value | Count | Frequency (%) |
| ♂ | 14 | |
| ♀ | 4 | 22.2% |
Modifier Letters
| Value | Count | Frequency (%) |
| ʼ | 1 |
organismID
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604719 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 9 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 70 21'9"W |
|---|
| Value | Count | Frequency (%) |
| 70 | 1 | |
| 21'9"w | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 7 | 1 | |
| 0 | 1 | |
| 1 | ||
| 2 | 1 | |
| 1 | 1 | |
| ' | 1 | |
| 9 | 1 | |
| " | 1 | |
| W | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5 | |
| Other Punctuation | 2 | 22.2% |
| Space Separator | 1 | 11.1% |
| Uppercase Letter | 1 | 11.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 1 | |
| 0 | 1 | |
| 2 | 1 | |
| 1 | 1 | |
| 9 | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 1 | |
| " | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8 | |
| Latin | 1 | 11.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 7 | 1 | |
| 0 | 1 | |
| 1 | ||
| 2 | 1 | |
| 1 | 1 | |
| ' | 1 | |
| 9 | 1 | |
| " | 1 |
Latin
| Value | Count | Frequency (%) |
| W | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 7 | 1 | |
| 0 | 1 | |
| 1 | ||
| 2 | 1 | |
| 1 | 1 | |
| ' | 1 | |
| 9 | 1 | |
| " | 1 | |
| W | 1 |
eventType
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604719 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | -11.7815 |
|---|
| Value | Count | Frequency (%) |
| 11.7815 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 3 | |
| - | 1 | 12.5% |
| . | 1 | 12.5% |
| 7 | 1 | 12.5% |
| 8 | 1 | 12.5% |
| 5 | 1 | 12.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6 | |
| Dash Punctuation | 1 | 12.5% |
| Other Punctuation | 1 | 12.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 3 | |
| 7 | 1 | 16.7% |
| 8 | 1 | 16.7% |
| 5 | 1 | 16.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 3 | |
| - | 1 | 12.5% |
| . | 1 | 12.5% |
| 7 | 1 | 12.5% |
| 8 | 1 | 12.5% |
| 5 | 1 | 12.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 3 | |
| - | 1 | 12.5% |
| . | 1 | 12.5% |
| 7 | 1 | 12.5% |
| 8 | 1 | 12.5% |
| 5 | 1 | 12.5% |
fieldNumber
Text
Missing 
| Distinct | 3093 |
|---|---|
| Distinct (%) | 72.7% |
| Missing | 600468 |
| Missing (%) | 99.3% |
| Memory size | 4.6 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 29 |
| Mean length | 9.591251176 |
| Min length | 1 |
Unique
| Unique | 2648 ? |
|---|---|
| Unique (%) | 62.3% |
Sample
| 1st row | BBB991 |
|---|---|
| 2nd row | BBB642-DERM |
| 3rd row | 1653 |
| 4th row | JSL021109-18 |
| 5th row | COL-8-101 |
| Value | Count | Frequency (%) |
| 1653 | 128 | 2.8% |
| 2 | 46 | 1.0% |
| bbb899-hym | 34 | 0.7% |
| 1 | 32 | 0.7% |
| bbb791-hym | 26 | 0.6% |
| bbb749-hym | 23 | 0.5% |
| 759-8 | 22 | 0.5% |
| tub | 20 | 0.4% |
| tank | 18 | 0.4% |
| 9 | 18 | 0.4% |
| Other values (3089) | 4227 |
Most occurring characters
| Value | Count | Frequency (%) |
| B | 4784 | 11.7% |
| 0 | 3997 | 9.8% |
| - | 3980 | 9.8% |
| 1 | 3402 | 8.3% |
| 2 | 2239 | 5.5% |
| 3 | 1558 | 3.8% |
| 6 | 1542 | 3.8% |
| 7 | 1514 | 3.7% |
| 4 | 1498 | 3.7% |
| 9 | 1482 | 3.6% |
| Other values (60) | 14786 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 19503 | |
| Uppercase Letter | 15056 | |
| Dash Punctuation | 3980 | 9.8% |
| Lowercase Letter | 1242 | 3.0% |
| Other Punctuation | 655 | 1.6% |
| Space Separator | 342 | 0.8% |
| Open Punctuation | 2 | < 0.1% |
| Close Punctuation | 2 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 4784 | |
| S | 1388 | 9.2% |
| T | 1136 | 7.5% |
| C | 792 | 5.3% |
| M | 764 | 5.1% |
| A | 708 | 4.7% |
| L | 667 | 4.4% |
| R | 639 | 4.2% |
| N | 583 | 3.9% |
| H | 533 | 3.5% |
| Other values (15) | 3062 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 146 | |
| a | 138 | |
| o | 134 | |
| t | 118 | 9.5% |
| b | 82 | 6.6% |
| n | 81 | 6.5% |
| r | 67 | 5.4% |
| m | 57 | 4.6% |
| c | 57 | 4.6% |
| i | 55 | 4.4% |
| Other values (13) | 307 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3997 | |
| 1 | 3402 | |
| 2 | 2239 | |
| 3 | 1558 | 8.0% |
| 6 | 1542 | 7.9% |
| 7 | 1514 | 7.8% |
| 4 | 1498 | 7.7% |
| 9 | 1482 | 7.6% |
| 5 | 1174 | 6.0% |
| 8 | 1097 | 5.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| # | 344 | |
| . | 200 | |
| ; | 93 | 14.2% |
| , | 10 | 1.5% |
| ' | 3 | 0.5% |
| " | 3 | 0.5% |
| / | 1 | 0.2% |
| : | 1 | 0.2% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3980 |
Space Separator
| Value | Count | Frequency (%) |
| 342 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 24484 | |
| Latin | 16298 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| B | 4784 | |
| S | 1388 | 8.5% |
| T | 1136 | 7.0% |
| C | 792 | 4.9% |
| M | 764 | 4.7% |
| A | 708 | 4.3% |
| L | 667 | 4.1% |
| R | 639 | 3.9% |
| N | 583 | 3.6% |
| H | 533 | 3.3% |
| Other values (38) | 4304 |
Common
| Value | Count | Frequency (%) |
| 0 | 3997 | |
| - | 3980 | |
| 1 | 3402 | |
| 2 | 2239 | |
| 3 | 1558 | 6.4% |
| 6 | 1542 | 6.3% |
| 7 | 1514 | 6.2% |
| 4 | 1498 | 6.1% |
| 9 | 1482 | 6.1% |
| 5 | 1174 | 4.8% |
| Other values (12) | 2098 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 40782 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| B | 4784 | 11.7% |
| 0 | 3997 | 9.8% |
| - | 3980 | 9.8% |
| 1 | 3402 | 8.3% |
| 2 | 2239 | 5.5% |
| 3 | 1558 | 3.8% |
| 6 | 1542 | 3.8% |
| 7 | 1514 | 3.7% |
| 4 | 1498 | 3.7% |
| 9 | 1482 | 3.6% |
| Other values (60) | 14786 |
eventDate
Text
Missing 
| Distinct | 46148 |
|---|---|
| Distinct (%) | 12.6% |
| Missing | 239420 |
| Missing (%) | 39.6% |
| Memory size | 4.6 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 10 |
| Mean length | 11.06884752 |
| Min length | 4 |
Unique
| Unique | 13232 ? |
|---|---|
| Unique (%) | 3.6% |
Sample
| 1st row | 1967-06-20 |
|---|---|
| 2nd row | 1914-07 |
| 3rd row | 2005-08-02 |
| 4th row | 1964-04-25 |
| 5th row | 1971-08-22 |
| Value | Count | Frequency (%) |
| 1998-07-26 | 709 | 0.2% |
| 1938 | 574 | 0.2% |
| 2006-06-24 | 544 | 0.1% |
| 1933 | 524 | 0.1% |
| 1960-06-30 | 506 | 0.1% |
| 1936 | 472 | 0.1% |
| 1927-07-10 | 469 | 0.1% |
| 1964-08-01/1964-08-31 | 449 | 0.1% |
| 1930 | 435 | 0.1% |
| 1966-06-23 | 407 | 0.1% |
| Other values (46130) | 360245 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 782638 | |
| 1 | 702348 | |
| 0 | 653087 | |
| 9 | 492965 | |
| 2 | 288127 | 7.1% |
| 6 | 225522 | 5.6% |
| 7 | 216691 | 5.4% |
| 8 | 183523 | 4.5% |
| 5 | 159737 | 4.0% |
| 3 | 155914 | 3.9% |
| Other values (6) | 182898 | 4.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3213904 | |
| Dash Punctuation | 782638 | 19.4% |
| Other Punctuation | 46840 | 1.2% |
| Space Separator | 34 | < 0.1% |
| Lowercase Letter | 34 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 702348 | |
| 0 | 653087 | |
| 9 | 492965 | |
| 2 | 288127 | |
| 6 | 225522 | 7.0% |
| 7 | 216691 | 6.7% |
| 8 | 183523 | 5.7% |
| 5 | 159737 | 5.0% |
| 3 | 155914 | 4.9% |
| 4 | 135990 | 4.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 46785 | |
| , | 55 | 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 17 | |
| r | 17 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 782638 |
Space Separator
| Value | Count | Frequency (%) |
| 34 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4043416 | |
| Latin | 34 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 782638 | |
| 1 | 702348 | |
| 0 | 653087 | |
| 9 | 492965 | |
| 2 | 288127 | 7.1% |
| 6 | 225522 | 5.6% |
| 7 | 216691 | 5.4% |
| 8 | 183523 | 4.5% |
| 5 | 159737 | 4.0% |
| 3 | 155914 | 3.9% |
| Other values (4) | 182864 | 4.5% |
Latin
| Value | Count | Frequency (%) |
| o | 17 | |
| r | 17 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4043450 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 782638 | |
| 1 | 702348 | |
| 0 | 653087 | |
| 9 | 492965 | |
| 2 | 288127 | 7.1% |
| 6 | 225522 | 5.6% |
| 7 | 216691 | 5.4% |
| 8 | 183523 | 4.5% |
| 5 | 159737 | 4.0% |
| 3 | 155914 | 3.9% |
| Other values (6) | 182898 | 4.5% |
startDayOfYear
Text
Missing 
| Distinct | 366 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 244789 |
| Missing (%) | 40.5% |
| Memory size | 4.6 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.849043289 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 171 |
|---|---|
| 2nd row | 212 |
| 3rd row | 214 |
| 4th row | 116 |
| 5th row | 234 |
| Value | Count | Frequency (%) |
| 212 | 4210 | 1.2% |
| 213 | 4014 | 1.1% |
| 182 | 3947 | 1.1% |
| 181 | 3445 | 1.0% |
| 151 | 3112 | 0.9% |
| 152 | 2941 | 0.8% |
| 183 | 2913 | 0.8% |
| 191 | 2887 | 0.8% |
| 207 | 2741 | 0.8% |
| 178 | 2632 | 0.7% |
| Other values (356) | 327089 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 238127 | |
| 2 | 202525 | |
| 3 | 100701 | |
| 9 | 70985 | 6.9% |
| 0 | 70947 | 6.9% |
| 4 | 70160 | 6.8% |
| 5 | 69390 | 6.8% |
| 8 | 68439 | 6.7% |
| 6 | 67900 | 6.6% |
| 7 | 66285 | 6.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1025459 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 238127 | |
| 2 | 202525 | |
| 3 | 100701 | |
| 9 | 70985 | 6.9% |
| 0 | 70947 | 6.9% |
| 4 | 70160 | 6.8% |
| 5 | 69390 | 6.8% |
| 8 | 68439 | 6.7% |
| 6 | 67900 | 6.6% |
| 7 | 66285 | 6.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1025459 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 238127 | |
| 2 | 202525 | |
| 3 | 100701 | |
| 9 | 70985 | 6.9% |
| 0 | 70947 | 6.9% |
| 4 | 70160 | 6.8% |
| 5 | 69390 | 6.8% |
| 8 | 68439 | 6.7% |
| 6 | 67900 | 6.6% |
| 7 | 66285 | 6.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1025459 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 238127 | |
| 2 | 202525 | |
| 3 | 100701 | |
| 9 | 70985 | 6.9% |
| 0 | 70947 | 6.9% |
| 4 | 70160 | 6.8% |
| 5 | 69390 | 6.8% |
| 8 | 68439 | 6.7% |
| 6 | 67900 | 6.6% |
| 7 | 66285 | 6.5% |
endDayOfYear
Text
Missing 
| Distinct | 366 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 244303 |
| Missing (%) | 40.4% |
| Memory size | 4.6 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.857215392 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 171 |
|---|---|
| 2nd row | 212 |
| 3rd row | 214 |
| 4th row | 116 |
| 5th row | 234 |
| Value | Count | Frequency (%) |
| 212 | 4994 | 1.4% |
| 181 | 4276 | 1.2% |
| 213 | 3666 | 1.0% |
| 151 | 3533 | 1.0% |
| 182 | 3365 | 0.9% |
| 243 | 3191 | 0.9% |
| 207 | 2999 | 0.8% |
| 191 | 2952 | 0.8% |
| 197 | 2774 | 0.8% |
| 120 | 2623 | 0.7% |
| Other values (356) | 326044 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 236813 | |
| 2 | 202617 | |
| 3 | 102147 | |
| 0 | 72053 | 7.0% |
| 9 | 71767 | 7.0% |
| 4 | 70263 | 6.8% |
| 5 | 69867 | 6.8% |
| 6 | 68660 | 6.7% |
| 7 | 67928 | 6.6% |
| 8 | 67674 | 6.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1029789 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 236813 | |
| 2 | 202617 | |
| 3 | 102147 | |
| 0 | 72053 | 7.0% |
| 9 | 71767 | 7.0% |
| 4 | 70263 | 6.8% |
| 5 | 69867 | 6.8% |
| 6 | 68660 | 6.7% |
| 7 | 67928 | 6.6% |
| 8 | 67674 | 6.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1029789 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 236813 | |
| 2 | 202617 | |
| 3 | 102147 | |
| 0 | 72053 | 7.0% |
| 9 | 71767 | 7.0% |
| 4 | 70263 | 6.8% |
| 5 | 69867 | 6.8% |
| 6 | 68660 | 6.7% |
| 7 | 67928 | 6.6% |
| 8 | 67674 | 6.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1029789 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 236813 | |
| 2 | 202617 | |
| 3 | 102147 | |
| 0 | 72053 | 7.0% |
| 9 | 71767 | 7.0% |
| 4 | 70263 | 6.8% |
| 5 | 69867 | 6.8% |
| 6 | 68660 | 6.7% |
| 7 | 67928 | 6.6% |
| 8 | 67674 | 6.6% |
year
Text
Missing 
| Distinct | 191 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 239420 |
| Missing (%) | 39.6% |
| Memory size | 4.6 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 16 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1967 |
|---|---|
| 2nd row | 1914 |
| 3rd row | 2005 |
| 4th row | 1964 |
| 5th row | 1971 |
| Value | Count | Frequency (%) |
| 1966 | 12313 | 3.4% |
| 1968 | 9194 | 2.5% |
| 1971 | 8970 | 2.5% |
| 1967 | 8361 | 2.3% |
| 1965 | 7882 | 2.2% |
| 1972 | 6275 | 1.7% |
| 1964 | 6152 | 1.7% |
| 1974 | 6096 | 1.7% |
| 1973 | 6078 | 1.7% |
| 1963 | 5563 | 1.5% |
| Other values (181) | 288416 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 398215 | |
| 9 | 381833 | |
| 6 | 108781 | 7.4% |
| 0 | 108045 | 7.4% |
| 2 | 92925 | 6.4% |
| 7 | 89271 | 6.1% |
| 8 | 74906 | 5.1% |
| 5 | 72462 | 5.0% |
| 3 | 69682 | 4.8% |
| 4 | 65080 | 4.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1461200 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 398215 | |
| 9 | 381833 | |
| 6 | 108781 | 7.4% |
| 0 | 108045 | 7.4% |
| 2 | 92925 | 6.4% |
| 7 | 89271 | 6.1% |
| 8 | 74906 | 5.1% |
| 5 | 72462 | 5.0% |
| 3 | 69682 | 4.8% |
| 4 | 65080 | 4.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1461200 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 398215 | |
| 9 | 381833 | |
| 6 | 108781 | 7.4% |
| 0 | 108045 | 7.4% |
| 2 | 92925 | 6.4% |
| 7 | 89271 | 6.1% |
| 8 | 74906 | 5.1% |
| 5 | 72462 | 5.0% |
| 3 | 69682 | 4.8% |
| 4 | 65080 | 4.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1461200 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 398215 | |
| 9 | 381833 | |
| 6 | 108781 | 7.4% |
| 0 | 108045 | 7.4% |
| 2 | 92925 | 6.4% |
| 7 | 89271 | 6.1% |
| 8 | 74906 | 5.1% |
| 5 | 72462 | 5.0% |
| 3 | 69682 | 4.8% |
| 4 | 65080 | 4.5% |
month
Text
Missing 
| Distinct | 13 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 246636 |
| Missing (%) | 40.8% |
| Memory size | 4.6 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 1 |
| Mean length | 1.113249964 |
| Min length | 1 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 6 |
|---|---|
| 2nd row | 7 |
| 3rd row | 8 |
| 4th row | 4 |
| 5th row | 8 |
| Value | Count | Frequency (%) |
| 7 | 74085 | |
| 6 | 58953 | |
| 8 | 51938 | |
| 5 | 36241 | |
| 9 | 26043 | 7.3% |
| 4 | 25759 | 7.2% |
| 3 | 16892 | 4.7% |
| 10 | 16541 | 4.6% |
| 2 | 14421 | 4.0% |
| 11 | 13740 | 3.8% |
| Other values (4) | 23473 | 6.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 7 | 74085 | |
| 1 | 67492 | |
| 6 | 58954 | |
| 8 | 51939 | |
| 5 | 36241 | |
| 9 | 26044 | 6.5% |
| 4 | 25760 | 6.5% |
| 2 | 24685 | 6.2% |
| 3 | 16892 | 4.2% |
| 0 | 16541 | 4.1% |
| Other values (3) | 4 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 398633 | |
| Space Separator | 2 | < 0.1% |
| Other Punctuation | 1 | < 0.1% |
| Uppercase Letter | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 74085 | |
| 1 | 67492 | |
| 6 | 58954 | |
| 8 | 51939 | |
| 5 | 36241 | |
| 9 | 26044 | 6.5% |
| 4 | 25760 | 6.5% |
| 2 | 24685 | 6.2% |
| 3 | 16892 | 4.2% |
| 0 | 16541 | 4.1% |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 398636 | |
| Latin | 1 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 7 | 74085 | |
| 1 | 67492 | |
| 6 | 58954 | |
| 8 | 51939 | |
| 5 | 36241 | |
| 9 | 26044 | 6.5% |
| 4 | 25760 | 6.5% |
| 2 | 24685 | 6.2% |
| 3 | 16892 | 4.2% |
| 0 | 16541 | 4.1% |
| Other values (2) | 3 | < 0.1% |
Latin
| Value | Count | Frequency (%) |
| S | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 398637 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 7 | 74085 | |
| 1 | 67492 | |
| 6 | 58954 | |
| 8 | 51939 | |
| 5 | 36241 | |
| 9 | 26044 | 6.5% |
| 4 | 25760 | 6.5% |
| 2 | 24685 | 6.2% |
| 3 | 16892 | 4.2% |
| 0 | 16541 | 4.1% |
| Other values (3) | 4 | < 0.1% |
day
Text
Missing 
| Distinct | 32 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 270887 |
| Missing (%) | 44.8% |
| Memory size | 4.6 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 2 |
| Mean length | 1.683176918 |
| Min length | 1 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 20 |
|---|---|
| 2nd row | 2 |
| 3rd row | 25 |
| 4th row | 22 |
| 5th row | 6 |
| Value | Count | Frequency (%) |
| 1 | 20555 | 6.2% |
| 8 | 13029 | 3.9% |
| 20 | 12179 | 3.6% |
| 10 | 11989 | 3.6% |
| 15 | 11884 | 3.6% |
| 12 | 11876 | 3.6% |
| 25 | 11249 | 3.4% |
| 6 | 11148 | 3.3% |
| 16 | 11145 | 3.3% |
| 23 | 10866 | 3.3% |
| Other values (24) | 207915 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 155576 | |
| 2 | 137768 | |
| 3 | 45163 | 8.0% |
| 8 | 33226 | 5.9% |
| 0 | 33162 | 5.9% |
| 5 | 33152 | 5.9% |
| 6 | 32821 | 5.8% |
| 4 | 31502 | 5.6% |
| 7 | 30851 | 5.5% |
| 9 | 28675 | 5.1% |
| Other values (3) | 4 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 561896 | |
| Space Separator | 2 | < 0.1% |
| Other Punctuation | 1 | < 0.1% |
| Uppercase Letter | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 155576 | |
| 2 | 137768 | |
| 3 | 45163 | 8.0% |
| 8 | 33226 | 5.9% |
| 0 | 33162 | 5.9% |
| 5 | 33152 | 5.9% |
| 6 | 32821 | 5.8% |
| 4 | 31502 | 5.6% |
| 7 | 30851 | 5.5% |
| 9 | 28675 | 5.1% |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 561899 | |
| Latin | 1 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 155576 | |
| 2 | 137768 | |
| 3 | 45163 | 8.0% |
| 8 | 33226 | 5.9% |
| 0 | 33162 | 5.9% |
| 5 | 33152 | 5.9% |
| 6 | 32821 | 5.8% |
| 4 | 31502 | 5.6% |
| 7 | 30851 | 5.5% |
| 9 | 28675 | 5.1% |
| Other values (2) | 3 | < 0.1% |
Latin
| Value | Count | Frequency (%) |
| W | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 561900 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 155576 | |
| 2 | 137768 | |
| 3 | 45163 | 8.0% |
| 8 | 33226 | 5.9% |
| 0 | 33162 | 5.9% |
| 5 | 33152 | 5.9% |
| 6 | 32821 | 5.8% |
| 4 | 31502 | 5.6% |
| 7 | 30851 | 5.5% |
| 9 | 28675 | 5.1% |
| Other values (3) | 4 | < 0.1% |
Missing 
| Distinct | 67999 |
|---|---|
| Distinct (%) | 32.6% |
| Missing | 396366 |
| Missing (%) | 65.5% |
| Memory size | 4.6 MiB |
Length
| Max length | 79 |
|---|---|
| Median length | 71 |
| Mean length | 10.59664321 |
| Min length | 1 |
Unique
| Unique | 51583 ? |
|---|---|
| Unique (%) | 24.8% |
Sample
| 1st row | [Not Stated] |
|---|---|
| 2nd row | 2-Aug-2005 |
| 3rd row | [Not Stated] |
| 4th row | [Not Stated] |
| 5th row | 9-IX-78 |
| Value | Count | Frequency (%) |
| not | 32203 | 8.2% |
| stated | 32171 | 8.2% |
| july | 8707 | 2.2% |
| aug | 7740 | 2.0% |
| june | 7233 | 1.8% |
| may | 5958 | 1.5% |
| 1968 | 5763 | 1.5% |
| 1971 | 5706 | 1.5% |
| 1966 | 4507 | 1.1% |
| 1972 | 2978 | 0.8% |
| Other values (37321) | 279788 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 217348 | 9.8% |
| 184400 | 8.4% | |
| 9 | 146706 | 6.6% |
| - | 127710 | 5.8% |
| 2 | 112946 | 5.1% |
| t | 105546 | 4.8% |
| I | 88881 | 4.0% |
| 6 | 79326 | 3.6% |
| 0 | 76313 | 3.5% |
| . | 64867 | 2.9% |
| Other values (82) | 1003810 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 900919 | |
| Lowercase Letter | 464865 | |
| Uppercase Letter | 333421 | 15.1% |
| Space Separator | 184400 | 8.4% |
| Other Punctuation | 128799 | 5.8% |
| Dash Punctuation | 127746 | 5.8% |
| Open Punctuation | 33635 | 1.5% |
| Close Punctuation | 33630 | 1.5% |
| Connector Punctuation | 250 | < 0.1% |
| Math Symbol | 187 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 105546 | |
| e | 57954 | |
| a | 49169 | |
| u | 41267 | 8.9% |
| o | 39668 | 8.5% |
| d | 33270 | 7.2% |
| n | 19822 | 4.3% |
| y | 17901 | 3.9% |
| l | 17064 | 3.7% |
| r | 16877 | 3.6% |
| Other values (18) | 66327 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 88881 | |
| V | 43512 | |
| N | 38313 | |
| S | 36919 | |
| J | 33537 | 10.1% |
| A | 23445 | 7.0% |
| M | 13905 | 4.2% |
| X | 9131 | 2.7% |
| U | 7429 | 2.2% |
| E | 5310 | 1.6% |
| Other values (17) | 33039 | 9.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 64867 | |
| , | 34981 | |
| / | 23005 | 17.9% |
| ' | 5024 | 3.9% |
| : | 620 | 0.5% |
| ? | 141 | 0.1% |
| ; | 102 | 0.1% |
| & | 38 | < 0.1% |
| " | 9 | < 0.1% |
| # | 6 | < 0.1% |
| Other values (3) | 6 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 217348 | |
| 9 | 146706 | |
| 2 | 112946 | |
| 6 | 79326 | 8.8% |
| 0 | 76313 | 8.5% |
| 7 | 63765 | 7.1% |
| 3 | 54211 | 6.0% |
| 8 | 53740 | 6.0% |
| 5 | 48644 | 5.4% |
| 4 | 47920 | 5.3% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 33547 | |
| ( | 82 | 0.2% |
| { | 6 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 33542 | |
| ) | 82 | 0.2% |
| } | 6 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| | | 156 | |
| + | 26 | 13.9% |
| = | 5 | 2.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 127710 | |
| – | 36 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 184400 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 250 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ^ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1409567 | |
| Latin | 798286 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 105546 | |
| I | 88881 | 11.1% |
| e | 57954 | 7.3% |
| a | 49169 | 6.2% |
| V | 43512 | 5.5% |
| u | 41267 | 5.2% |
| o | 39668 | 5.0% |
| N | 38313 | 4.8% |
| S | 36919 | 4.6% |
| J | 33537 | 4.2% |
| Other values (45) | 263520 |
Common
| Value | Count | Frequency (%) |
| 1 | 217348 | |
| 184400 | ||
| 9 | 146706 | |
| - | 127710 | |
| 2 | 112946 | |
| 6 | 79326 | 5.6% |
| 0 | 76313 | 5.4% |
| . | 64867 | 4.6% |
| 7 | 63765 | 4.5% |
| 3 | 54211 | 3.8% |
| Other values (27) | 281975 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2207812 | |
| Punctuation | 37 | < 0.1% |
| None | 4 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 217348 | 9.8% |
| 184400 | 8.4% | |
| 9 | 146706 | 6.6% |
| - | 127710 | 5.8% |
| 2 | 112946 | 5.1% |
| t | 105546 | 4.8% |
| I | 88881 | 4.0% |
| 6 | 79326 | 3.6% |
| 0 | 76313 | 3.5% |
| . | 64867 | 2.9% |
| Other values (77) | 1003769 |
Punctuation
| Value | Count | Frequency (%) |
| – | 36 | |
| … | 1 | 2.7% |
None
| Value | Count | Frequency (%) |
| û | 2 | |
| Ç | 1 | |
| ÿ | 1 |
habitat
Text
Missing 
| Distinct | 89 |
|---|---|
| Distinct (%) | 44.7% |
| Missing | 604521 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 103 |
|---|---|
| Median length | 43 |
| Mean length | 19.28643216 |
| Min length | 5 |
Unique
| Unique | 64 ? |
|---|---|
| Unique (%) | 32.2% |
Sample
| 1st row | Roadside in coniferous forest |
|---|---|
| 2nd row | On a figleaf gourd |
| 3rd row | cultivated garden |
| 4th row | hammocks-dense hardwood & Palmetto forests |
| 5th row | visiting mango flowers |
| Value | Count | Frequency (%) |
| garden | 45 | 7.4% |
| cultivated | 44 | 7.3% |
| stream | 26 | 4.3% |
| on | 26 | 4.3% |
| forest | 23 | 3.8% |
| in | 19 | 3.1% |
| of | 13 | 2.1% |
| collected | 12 | 2.0% |
| at | 9 | 1.5% |
| terre | 8 | 1.3% |
| Other values (183) | 381 |
Most occurring characters
| Value | Count | Frequency (%) |
| 407 | 10.6% | |
| e | 388 | 10.1% |
| a | 308 | 8.0% |
| r | 258 | 6.7% |
| t | 250 | 6.5% |
| d | 224 | 5.8% |
| n | 223 | 5.8% |
| o | 217 | 5.7% |
| i | 190 | 5.0% |
| l | 185 | 4.8% |
| Other values (52) | 1188 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3215 | |
| Space Separator | 407 | 10.6% |
| Uppercase Letter | 126 | 3.3% |
| Other Punctuation | 51 | 1.3% |
| Decimal Number | 27 | 0.7% |
| Dash Punctuation | 6 | 0.2% |
| Close Punctuation | 3 | 0.1% |
| Open Punctuation | 3 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 388 | |
| a | 308 | 9.6% |
| r | 258 | 8.0% |
| t | 250 | 7.8% |
| d | 224 | 7.0% |
| n | 223 | 6.9% |
| o | 217 | 6.7% |
| i | 190 | 5.9% |
| l | 185 | 5.8% |
| s | 175 | 5.4% |
| Other values (15) | 797 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 28 | |
| C | 24 | |
| R | 9 | 7.1% |
| O | 9 | 7.1% |
| P | 8 | 6.3% |
| T | 7 | 5.6% |
| I | 6 | 4.8% |
| W | 5 | 4.0% |
| F | 5 | 4.0% |
| E | 4 | 3.2% |
| Other values (10) | 21 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 8 | |
| 2 | 6 | |
| 1 | 5 | |
| 3 | 4 | |
| 8 | 2 | 7.4% |
| 5 | 1 | 3.7% |
| 7 | 1 | 3.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 19 | |
| . | 16 | |
| " | 6 | 11.8% |
| : | 5 | 9.8% |
| & | 3 | 5.9% |
| / | 2 | 3.9% |
Space Separator
| Value | Count | Frequency (%) |
| 407 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 6 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 3 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3341 | |
| Common | 497 | 12.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 388 | |
| a | 308 | 9.2% |
| r | 258 | 7.7% |
| t | 250 | 7.5% |
| d | 224 | 6.7% |
| n | 223 | 6.7% |
| o | 217 | 6.5% |
| i | 190 | 5.7% |
| l | 185 | 5.5% |
| s | 175 | 5.2% |
| Other values (35) | 923 |
Common
| Value | Count | Frequency (%) |
| 407 | ||
| , | 19 | 3.8% |
| . | 16 | 3.2% |
| 0 | 8 | 1.6% |
| " | 6 | 1.2% |
| 2 | 6 | 1.2% |
| - | 6 | 1.2% |
| 1 | 5 | 1.0% |
| : | 5 | 1.0% |
| 3 | 4 | 0.8% |
| Other values (7) | 15 | 3.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3838 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 407 | 10.6% | |
| e | 388 | 10.1% |
| a | 308 | 8.0% |
| r | 258 | 6.7% |
| t | 250 | 6.5% |
| d | 224 | 5.8% |
| n | 223 | 5.8% |
| o | 217 | 5.7% |
| i | 190 | 5.0% |
| l | 185 | 4.8% |
| Other values (52) | 1188 |
locationID
Text
Missing 
| Distinct | 185 |
|---|---|
| Distinct (%) | 17.7% |
| Missing | 603675 |
| Missing (%) | 99.8% |
| Memory size | 4.6 MiB |
Length
| Max length | 40 |
|---|---|
| Median length | 14 |
| Mean length | 10.78947368 |
| Min length | 1 |
Unique
| Unique | 94 ? |
|---|---|
| Unique (%) | 9.0% |
Sample
| 1st row | MEI Site 97-81 |
|---|---|
| 2nd row | RD-044 |
| 3rd row | MEI Site 97-81 |
| 4th row | MEI Site 97-81 |
| 5th row | MEI Site 97-81 |
| Value | Count | Frequency (%) |
| mei | 652 | |
| site | 610 | |
| 97-81 | 301 | |
| 97-92 | 132 | 5.6% |
| 97-90 | 52 | 2.2% |
| 97-58 | 46 | 1.9% |
| 97-74 | 31 | 1.3% |
| 97-88 | 26 | 1.1% |
| 97-93 | 24 | 1.0% |
| k-m1 | 19 | 0.8% |
| Other values (195) | 479 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1327 | 11.8% | |
| - | 986 | 8.7% |
| 9 | 904 | 8.0% |
| 7 | 770 | 6.8% |
| M | 698 | 6.2% |
| I | 659 | 5.8% |
| E | 656 | 5.8% |
| t | 638 | 5.7% |
| e | 637 | 5.6% |
| i | 624 | 5.5% |
| Other values (46) | 3376 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3620 | |
| Uppercase Letter | 3287 | |
| Lowercase Letter | 2029 | |
| Space Separator | 1327 | 11.8% |
| Dash Punctuation | 986 | 8.7% |
| Other Punctuation | 26 | 0.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 698 | |
| I | 659 | |
| E | 656 | |
| S | 609 | |
| R | 278 | 8.5% |
| D | 272 | 8.3% |
| K | 20 | 0.6% |
| J | 14 | 0.4% |
| N | 11 | 0.3% |
| L | 11 | 0.3% |
| Other values (11) | 59 | 1.8% |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 638 | |
| e | 637 | |
| i | 624 | |
| l | 27 | 1.3% |
| a | 20 | 1.0% |
| s | 20 | 1.0% |
| r | 10 | 0.5% |
| o | 8 | 0.4% |
| n | 7 | 0.3% |
| p | 7 | 0.3% |
| Other values (9) | 31 | 1.5% |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 904 | |
| 7 | 770 | |
| 1 | 571 | |
| 8 | 458 | |
| 2 | 322 | 8.9% |
| 0 | 184 | 5.1% |
| 5 | 143 | 4.0% |
| 4 | 95 | 2.6% |
| 6 | 87 | 2.4% |
| 3 | 86 | 2.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| # | 19 | |
| , | 5 | 19.2% |
| . | 1 | 3.8% |
| : | 1 | 3.8% |
Space Separator
| Value | Count | Frequency (%) |
| 1327 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 986 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5959 | |
| Latin | 5316 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 698 | |
| I | 659 | |
| E | 656 | |
| t | 638 | |
| e | 637 | |
| i | 624 | |
| S | 609 | |
| R | 278 | 5.2% |
| D | 272 | 5.1% |
| l | 27 | 0.5% |
| Other values (30) | 218 | 4.1% |
Common
| Value | Count | Frequency (%) |
| 1327 | ||
| - | 986 | |
| 9 | 904 | |
| 7 | 770 | |
| 1 | 571 | |
| 8 | 458 | 7.7% |
| 2 | 322 | 5.4% |
| 0 | 184 | 3.1% |
| 5 | 143 | 2.4% |
| 4 | 95 | 1.6% |
| Other values (6) | 199 | 3.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11275 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1327 | 11.8% | |
| - | 986 | 8.7% |
| 9 | 904 | 8.0% |
| 7 | 770 | 6.8% |
| M | 698 | 6.2% |
| I | 659 | 5.8% |
| E | 656 | 5.8% |
| t | 638 | 5.7% |
| e | 637 | 5.6% |
| i | 624 | 5.5% |
| Other values (46) | 3376 |
higherGeography
Text
Missing 
| Distinct | 10596 |
|---|---|
| Distinct (%) | 2.4% |
| Missing | 156093 |
| Missing (%) | 25.8% |
| Memory size | 4.6 MiB |
Length
| Max length | 101 |
|---|---|
| Median length | 91 |
| Mean length | 30.38929222 |
| Min length | 4 |
Unique
| Unique | 3142 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | United States, [Not Stated], [Not Stated] |
|---|---|
| 2nd row | Costa Rica, Cartago, [Not Stated] |
| 3rd row | United States, Alaska, Aleutians West |
| 4th row | United States, Virginia, Virginia Beach |
| 5th row | United States, New York, [Not Stated] |
| Value | Count | Frequency (%) |
| united | 222849 | 12.1% |
| states | 221117 | 12.1% |
| not | 168021 | 9.2% |
| stated | 168019 | 9.2% |
| california | 23411 | 1.3% |
| virginia | 23321 | 1.3% |
| new | 22503 | 1.2% |
| colorado | 21080 | 1.1% |
| mexico | 21004 | 1.1% |
| canada | 16233 | 0.9% |
| Other values (6796) | 927210 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1386867 | 10.2% |
| t | 1386839 | 10.2% |
| 1386141 | 10.2% | |
| e | 1091011 | 8.0% |
| i | 816099 | 6.0% |
| n | 814243 | 6.0% |
| , | 798935 | 5.9% |
| o | 692570 | 5.1% |
| d | 580440 | 4.3% |
| s | 501693 | 3.7% |
| Other values (122) | 4178619 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9268484 | |
| Uppercase Letter | 1826618 | 13.4% |
| Space Separator | 1386141 | 10.2% |
| Other Punctuation | 805778 | 5.9% |
| Open Punctuation | 168048 | 1.2% |
| Close Punctuation | 167999 | 1.2% |
| Dash Punctuation | 10310 | 0.1% |
| Decimal Number | 75 | < 0.1% |
| Modifier Letter | 2 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1386867 | |
| t | 1386839 | |
| e | 1091011 | |
| i | 816099 | |
| n | 814243 | |
| o | 692570 | |
| d | 580440 | |
| s | 501693 | 5.4% |
| r | 454316 | 4.9% |
| l | 313893 | 3.4% |
| Other values (59) | 1230513 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 462311 | |
| U | 242099 | |
| N | 220752 | |
| C | 174694 | 9.6% |
| M | 92438 | 5.1% |
| P | 64247 | 3.5% |
| B | 57602 | 3.2% |
| A | 54181 | 3.0% |
| T | 52091 | 2.9% |
| I | 45082 | 2.5% |
| Other values (27) | 361121 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 798935 | |
| ' | 3984 | 0.5% |
| . | 2433 | 0.3% |
| / | 183 | < 0.1% |
| ? | 152 | < 0.1% |
| & | 50 | < 0.1% |
| : | 39 | < 0.1% |
| ; | 1 | < 0.1% |
| ¡ | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 46 | |
| 9 | 14 | 18.7% |
| 4 | 11 | 14.7% |
| 2 | 2 | 2.7% |
| 8 | 1 | 1.3% |
| 1 | 1 | 1.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 10286 | |
| – | 22 | 0.2% |
| — | 2 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 168014 | |
| ( | 34 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 167965 | |
| ) | 34 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1386141 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 2 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 1 |
Control
| Value | Count | Frequency (%) |
| | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11095102 | |
| Common | 2538355 | 18.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1386867 | |
| t | 1386839 | |
| e | 1091011 | 9.8% |
| i | 816099 | 7.4% |
| n | 814243 | 7.3% |
| o | 692570 | 6.2% |
| d | 580440 | 5.2% |
| s | 501693 | 4.5% |
| S | 462311 | 4.2% |
| r | 454316 | 4.1% |
| Other values (96) | 2908713 |
Common
| Value | Count | Frequency (%) |
| 1386141 | ||
| , | 798935 | |
| [ | 168014 | 6.6% |
| ] | 167965 | 6.6% |
| - | 10286 | 0.4% |
| ' | 3984 | 0.2% |
| . | 2433 | 0.1% |
| / | 183 | < 0.1% |
| ? | 152 | < 0.1% |
| & | 50 | < 0.1% |
| Other values (16) | 212 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13627164 | |
| None | 6245 | < 0.1% |
| Punctuation | 24 | < 0.1% |
| Latin Ext Additional | 22 | < 0.1% |
| Modifier Letters | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1386867 | 10.2% |
| t | 1386839 | 10.2% |
| 1386141 | 10.2% | |
| e | 1091011 | 8.0% |
| i | 816099 | 6.0% |
| n | 814243 | 6.0% |
| , | 798935 | 5.9% |
| o | 692570 | 5.1% |
| d | 580440 | 4.3% |
| s | 501693 | 3.7% |
| Other values (63) | 4172326 |
None
| Value | Count | Frequency (%) |
| á | 1227 | |
| ü | 1114 | |
| í | 1027 | |
| ó | 731 | |
| é | 700 | |
| ã | 292 | 4.7% |
| ô | 268 | 4.3% |
| ø | 167 | 2.7% |
| è | 135 | 2.2% |
| ä | 68 | 1.1% |
| Other values (45) | 516 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ị | 22 |
Punctuation
| Value | Count | Frequency (%) |
| – | 22 | |
| — | 2 | 8.3% |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 2 |
continent
Text
Missing 
| Distinct | 6 |
|---|---|
| Distinct (%) | 4.7% |
| Missing | 604592 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 4 |
| Mean length | 7.15625 |
| Min length | 4 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 0.8% |
Sample
| 1st row | South America |
|---|---|
| 2nd row | Asia |
| 3rd row | South America |
| 4th row | Europe |
| 5th row | Asia |
| Value | Count | Frequency (%) |
| asia | 69 | |
| america | 40 | |
| north | 21 | 12.4% |
| south | 19 | 11.2% |
| europe | 9 | 5.3% |
| africa | 9 | 5.3% |
| not | 1 | 0.6% |
| stated | 1 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 119 | |
| A | 118 | |
| i | 118 | |
| r | 79 | |
| s | 69 | 7.5% |
| o | 50 | 5.5% |
| e | 50 | 5.5% |
| c | 49 | 5.3% |
| t | 43 | 4.7% |
| 41 | 4.5% | |
| Other values (11) | 180 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 704 | |
| Uppercase Letter | 169 | 18.4% |
| Space Separator | 41 | 4.5% |
| Open Punctuation | 1 | 0.1% |
| Close Punctuation | 1 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 119 | |
| i | 118 | |
| r | 79 | |
| s | 69 | |
| o | 50 | |
| e | 50 | |
| c | 49 | |
| t | 43 | 6.1% |
| m | 40 | 5.7% |
| h | 40 | 5.7% |
| Other values (4) | 47 | 6.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 118 | |
| N | 22 | 13.0% |
| S | 20 | 11.8% |
| E | 9 | 5.3% |
Space Separator
| Value | Count | Frequency (%) |
| 41 |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 873 | |
| Common | 43 | 4.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 119 | |
| A | 118 | |
| i | 118 | |
| r | 79 | |
| s | 69 | |
| o | 50 | 5.7% |
| e | 50 | 5.7% |
| c | 49 | 5.6% |
| t | 43 | 4.9% |
| m | 40 | 4.6% |
| Other values (8) | 138 |
Common
| Value | Count | Frequency (%) |
| 41 | ||
| [ | 1 | 2.3% |
| ] | 1 | 2.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 916 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 119 | |
| A | 118 | |
| i | 118 | |
| r | 79 | |
| s | 69 | 7.5% |
| o | 50 | 5.5% |
| e | 50 | 5.5% |
| c | 49 | 5.3% |
| t | 43 | 4.7% |
| 41 | 4.5% | |
| Other values (11) | 180 |
waterBody
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604719 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 9 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | DeMarmels |
|---|
| Value | Count | Frequency (%) |
| demarmels | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 2 | |
| D | 1 | |
| M | 1 | |
| a | 1 | |
| r | 1 | |
| m | 1 | |
| l | 1 | |
| s | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7 | |
| Uppercase Letter | 2 | 22.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2 | |
| a | 1 | |
| r | 1 | |
| m | 1 | |
| l | 1 | |
| s | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 1 | |
| M | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 2 | |
| D | 1 | |
| M | 1 | |
| a | 1 | |
| r | 1 | |
| m | 1 | |
| l | 1 | |
| s | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 2 | |
| D | 1 | |
| M | 1 | |
| a | 1 | |
| r | 1 | |
| m | 1 | |
| l | 1 | |
| s | 1 |
islandGroup
Text
Missing 
| Distinct | 72 |
|---|---|
| Distinct (%) | 2.9% |
| Missing | 602200 |
| Missing (%) | 99.6% |
| Memory size | 4.6 MiB |
Length
| Max length | 22 |
|---|---|
| Median length | 13 |
| Mean length | 13.7202381 |
| Min length | 5 |
Unique
| Unique | 21 ? |
|---|---|
| Unique (%) | 0.8% |
Sample
| 1st row | Sunda Islands |
|---|---|
| 2nd row | Inner Islands |
| 3rd row | Viti Levu Group |
| 4th row | Chuuk Lagoon |
| 5th row | Sunda Islands |
| Value | Count | Frequency (%) |
| islands | 2160 | |
| sunda | 956 | |
| marquesas | 249 | 4.9% |
| solomon | 226 | 4.4% |
| bass | 171 | 3.3% |
| chuuk | 149 | 2.9% |
| lagoon | 149 | 2.9% |
| outer | 149 | 2.9% |
| inner | 140 | 2.7% |
| group | 100 | 2.0% |
| Other values (78) | 673 | 13.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 5365 | |
| a | 4395 | |
| n | 3948 | |
| d | 3266 | |
| 2602 | ||
| l | 2568 | |
| I | 2313 | |
| u | 1953 | 5.6% |
| S | 1250 | 3.6% |
| o | 1226 | 3.5% |
| Other values (39) | 5689 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 26832 | |
| Uppercase Letter | 5122 | 14.8% |
| Space Separator | 2602 | 7.5% |
| Other Punctuation | 19 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 5365 | |
| a | 4395 | |
| n | 3948 | |
| d | 3266 | |
| l | 2568 | |
| u | 1953 | 7.3% |
| o | 1226 | 4.6% |
| r | 905 | 3.4% |
| e | 893 | 3.3% |
| i | 343 | 1.3% |
| Other values (14) | 1970 | 7.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 2313 | |
| S | 1250 | |
| M | 256 | 5.0% |
| L | 237 | 4.6% |
| C | 200 | 3.9% |
| B | 171 | 3.3% |
| O | 158 | 3.1% |
| G | 147 | 2.9% |
| V | 87 | 1.7% |
| N | 75 | 1.5% |
| Other values (12) | 228 | 4.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 10 | |
| . | 9 |
Space Separator
| Value | Count | Frequency (%) |
| 2602 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 31954 | |
| Common | 2621 | 7.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 5365 | |
| a | 4395 | |
| n | 3948 | |
| d | 3266 | |
| l | 2568 | |
| I | 2313 | |
| u | 1953 | 6.1% |
| S | 1250 | 3.9% |
| o | 1226 | 3.8% |
| r | 905 | 2.8% |
| Other values (36) | 4765 |
Common
| Value | Count | Frequency (%) |
| 2602 | ||
| ' | 10 | 0.4% |
| . | 9 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 34575 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 5365 | |
| a | 4395 | |
| n | 3948 | |
| d | 3266 | |
| 2602 | ||
| l | 2568 | |
| I | 2313 | |
| u | 1953 | 5.6% |
| S | 1250 | 3.6% |
| o | 1226 | 3.5% |
| Other values (39) | 5689 |
island
Text
Missing 
| Distinct | 436 |
|---|---|
| Distinct (%) | 4.7% |
| Missing | 595353 |
| Missing (%) | 98.5% |
| Memory size | 4.6 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 21 |
| Mean length | 9.324436853 |
| Min length | 3 |
Unique
| Unique | 168 ? |
|---|---|
| Unique (%) | 1.8% |
Sample
| 1st row | South Island |
|---|---|
| 2nd row | Pohnpei |
| 3rd row | South Island |
| 4th row | Oahu |
| 5th row | Guadalcanal |
| Value | Count | Frequency (%) |
| island | 3167 | |
| south | 1636 | 11.1% |
| java | 884 | 6.0% |
| levu | 565 | 3.8% |
| viti | 541 | 3.7% |
| north | 519 | 3.5% |
| guadalcanal | 327 | 2.2% |
| borneo | 253 | 1.7% |
| hiva | 247 | 1.7% |
| key | 246 | 1.7% |
| Other values (438) | 6372 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 12933 | |
| n | 6143 | 7.0% |
| l | 5485 | 6.3% |
| o | 5446 | 6.2% |
| 5390 | 6.2% | |
| u | 4466 | 5.1% |
| d | 4450 | 5.1% |
| s | 4126 | 4.7% |
| e | 3908 | 4.5% |
| t | 3745 | 4.3% |
| Other values (52) | 31250 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 66998 | |
| Uppercase Letter | 14740 | 16.9% |
| Space Separator | 5390 | 6.2% |
| Other Punctuation | 169 | 0.2% |
| Dash Punctuation | 18 | < 0.1% |
| Open Punctuation | 13 | < 0.1% |
| Close Punctuation | 13 | < 0.1% |
| Modifier Letter | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 12933 | |
| n | 6143 | |
| l | 5485 | |
| o | 5446 | |
| u | 4466 | 6.7% |
| d | 4450 | 6.6% |
| s | 4126 | 6.2% |
| e | 3908 | 5.8% |
| t | 3745 | 5.6% |
| i | 3651 | 5.4% |
| Other values (19) | 12645 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 3295 | |
| S | 2358 | |
| N | 1067 | 7.2% |
| J | 892 | 6.1% |
| L | 820 | 5.6% |
| B | 722 | 4.9% |
| V | 681 | 4.6% |
| G | 648 | 4.4% |
| M | 648 | 4.4% |
| H | 619 | 4.2% |
| Other values (14) | 2990 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 164 | |
| . | 5 | 3.0% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 12 | |
| [ | 1 | 7.7% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 12 | |
| ] | 1 | 7.7% |
Space Separator
| Value | Count | Frequency (%) |
| 5390 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 18 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 81738 | |
| Common | 5604 | 6.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 12933 | |
| n | 6143 | 7.5% |
| l | 5485 | 6.7% |
| o | 5446 | 6.7% |
| u | 4466 | 5.5% |
| d | 4450 | 5.4% |
| s | 4126 | 5.0% |
| e | 3908 | 4.8% |
| t | 3745 | 4.6% |
| i | 3651 | 4.5% |
| Other values (43) | 27385 |
Common
| Value | Count | Frequency (%) |
| 5390 | ||
| ' | 164 | 2.9% |
| - | 18 | 0.3% |
| ( | 12 | 0.2% |
| ) | 12 | 0.2% |
| . | 5 | 0.1% |
| ʻ | 1 | < 0.1% |
| [ | 1 | < 0.1% |
| ] | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 87316 | |
| None | 25 | < 0.1% |
| Modifier Letters | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 12933 | |
| n | 6143 | 7.0% |
| l | 5485 | 6.3% |
| o | 5446 | 6.2% |
| 5390 | 6.2% | |
| u | 4466 | 5.1% |
| d | 4450 | 5.1% |
| s | 4126 | 4.7% |
| e | 3908 | 4.5% |
| t | 3745 | 4.3% |
| Other values (47) | 31224 |
None
| Value | Count | Frequency (%) |
| ñ | 13 | |
| ó | 7 | |
| é | 4 | 16.0% |
| Ž | 1 | 4.0% |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 1 |
country
Text
Missing 
| Distinct | 361 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 156115 |
| Missing (%) | 25.8% |
| Memory size | 4.6 MiB |
Length
| Max length | 57 |
|---|---|
| Median length | 44 |
| Mean length | 10.35667681 |
| Min length | 4 |
Unique
| Unique | 74 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | United States |
|---|---|
| 2nd row | Costa Rica |
| 3rd row | United States |
| 4th row | United States |
| 5th row | United States |
| Value | Count | Frequency (%) |
| united | 222629 | |
| states | 220899 | |
| canada | 16232 | 2.3% |
| mexico | 15811 | 2.2% |
| china | 14526 | 2.0% |
| brazil | 12973 | 1.8% |
| costa | 8910 | 1.2% |
| rica | 8910 | 1.2% |
| peru | 7637 | 1.1% |
| india | 7029 | 1.0% |
| Other values (376) | 184674 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 718259 | |
| e | 560772 | |
| a | 528526 | |
| i | 389761 | |
| n | 365385 | |
| d | 287382 | 6.2% |
| 271625 | 5.8% | |
| s | 261111 | 5.6% |
| S | 244256 | 5.3% |
| U | 223931 | 4.8% |
| Other values (56) | 795049 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3644367 | |
| Uppercase Letter | 715786 | 15.4% |
| Space Separator | 271625 | 5.8% |
| Close Punctuation | 6527 | 0.1% |
| Open Punctuation | 6527 | 0.1% |
| Other Punctuation | 1214 | < 0.1% |
| Dash Punctuation | 11 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 718259 | |
| e | 560772 | |
| a | 528526 | |
| i | 389761 | |
| n | 365385 | |
| d | 287382 | |
| s | 261111 | 7.2% |
| o | 84202 | 2.3% |
| r | 69612 | 1.9% |
| l | 68426 | 1.9% |
| Other values (18) | 310931 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 244256 | |
| U | 223931 | |
| C | 53206 | 7.4% |
| P | 26355 | 3.7% |
| M | 23603 | 3.3% |
| B | 19253 | 2.7% |
| I | 15375 | 2.1% |
| N | 14724 | 2.1% |
| R | 12978 | 1.8% |
| G | 12741 | 1.8% |
| Other values (15) | 69364 | 9.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 945 | |
| . | 111 | 9.1% |
| ' | 105 | 8.6% |
| : | 36 | 3.0% |
| ? | 10 | 0.8% |
| / | 6 | 0.5% |
| ; | 1 | 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 6524 | |
| ) | 3 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 6524 | |
| ( | 3 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 271625 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 11 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4360153 | |
| Common | 285904 | 6.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 718259 | |
| e | 560772 | |
| a | 528526 | |
| i | 389761 | |
| n | 365385 | |
| d | 287382 | |
| s | 261111 | 6.0% |
| S | 244256 | 5.6% |
| U | 223931 | 5.1% |
| o | 84202 | 1.9% |
| Other values (43) | 696568 |
Common
| Value | Count | Frequency (%) |
| 271625 | ||
| ] | 6524 | 2.3% |
| [ | 6524 | 2.3% |
| , | 945 | 0.3% |
| . | 111 | < 0.1% |
| ' | 105 | < 0.1% |
| : | 36 | < 0.1% |
| - | 11 | < 0.1% |
| ? | 10 | < 0.1% |
| / | 6 | < 0.1% |
| Other values (3) | 7 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4645987 | |
| None | 70 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 718259 | |
| e | 560772 | |
| a | 528526 | |
| i | 389761 | |
| n | 365385 | |
| d | 287382 | 6.2% |
| 271625 | 5.8% | |
| s | 261111 | 5.6% |
| S | 244256 | 5.3% |
| U | 223931 | 4.8% |
| Other values (54) | 794979 |
None
| Value | Count | Frequency (%) |
| ô | 69 | |
| ç | 1 | 1.4% |
stateProvince
Text
Missing 
| Distinct | 3068 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 173239 |
| Missing (%) | 28.6% |
| Memory size | 4.6 MiB |
Length
| Max length | 57 |
|---|---|
| Median length | 44 |
| Mean length | 9.044942883 |
| Min length | 2 |
Unique
| Unique | 808 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | [Not Stated] |
|---|---|
| 2nd row | Cartago |
| 3rd row | Alaska |
| 4th row | Virginia |
| 5th row | New York |
| Value | Count | Frequency (%) |
| not | 29440 | 5.2% |
| stated | 29440 | 5.2% |
| california | 23322 | 4.1% |
| virginia | 22013 | 3.9% |
| colorado | 20952 | 3.7% |
| new | 16651 | 2.9% |
| texas | 12341 | 2.2% |
| arizona | 12146 | 2.1% |
| florida | 9884 | 1.7% |
| maryland | 9608 | 1.7% |
| Other values (2915) | 379877 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 524357 | 13.4% |
| o | 333196 | 8.5% |
| i | 321786 | 8.2% |
| n | 299093 | 7.7% |
| r | 250082 | 6.4% |
| e | 216703 | 5.6% |
| t | 208658 | 5.3% |
| s | 151919 | 3.9% |
| l | 138292 | 3.5% |
| 134193 | 3.4% | |
| Other values (106) | 1324442 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3135689 | |
| Uppercase Letter | 563884 | 14.4% |
| Space Separator | 134193 | 3.4% |
| Open Punctuation | 29409 | 0.8% |
| Close Punctuation | 29400 | 0.8% |
| Dash Punctuation | 8111 | 0.2% |
| Other Punctuation | 1958 | 0.1% |
| Decimal Number | 75 | < 0.1% |
| Control | 1 | < 0.1% |
| Modifier Letter | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 524357 | |
| o | 333196 | |
| i | 321786 | |
| n | 299093 | |
| r | 250082 | |
| e | 216703 | 6.9% |
| t | 208658 | 6.7% |
| s | 151919 | 4.8% |
| l | 138292 | 4.4% |
| d | 113012 | 3.6% |
| Other values (49) | 578591 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 79656 | |
| N | 67159 | |
| S | 61382 | |
| M | 46174 | 8.2% |
| T | 31302 | 5.6% |
| A | 30288 | 5.4% |
| V | 29057 | 5.2% |
| W | 27116 | 4.8% |
| P | 20337 | 3.6% |
| I | 18302 | 3.2% |
| Other values (25) | 153111 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 987 | |
| ' | 638 | |
| ? | 138 | 7.0% |
| / | 121 | 6.2% |
| , | 70 | 3.6% |
| : | 3 | 0.2% |
| ¡ | 1 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 46 | |
| 9 | 14 | 18.7% |
| 4 | 11 | 14.7% |
| 2 | 2 | 2.7% |
| 8 | 1 | 1.3% |
| 1 | 1 | 1.3% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 29408 | |
| ( | 1 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 29399 | |
| ) | 1 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 8089 | |
| – | 22 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 134193 |
Control
| Value | Count | Frequency (%) |
| | 1 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3699573 | |
| Common | 203148 | 5.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 524357 | |
| o | 333196 | 9.0% |
| i | 321786 | 8.7% |
| n | 299093 | 8.1% |
| r | 250082 | 6.8% |
| e | 216703 | 5.9% |
| t | 208658 | 5.6% |
| s | 151919 | 4.1% |
| l | 138292 | 3.7% |
| d | 113012 | 3.1% |
| Other values (84) | 1142475 |
Common
| Value | Count | Frequency (%) |
| 134193 | ||
| [ | 29408 | 14.5% |
| ] | 29399 | 14.5% |
| - | 8089 | 4.0% |
| . | 987 | 0.5% |
| ' | 638 | 0.3% |
| ? | 138 | 0.1% |
| / | 121 | 0.1% |
| , | 70 | < 0.1% |
| 3 | 46 | < 0.1% |
| Other values (12) | 59 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3897577 | |
| None | 5099 | 0.1% |
| Latin Ext Additional | 22 | < 0.1% |
| Punctuation | 22 | < 0.1% |
| Modifier Letters | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 524357 | 13.5% |
| o | 333196 | 8.5% |
| i | 321786 | 8.3% |
| n | 299093 | 7.7% |
| r | 250082 | 6.4% |
| e | 216703 | 5.6% |
| t | 208658 | 5.4% |
| s | 151919 | 3.9% |
| l | 138292 | 3.5% |
| 134193 | 3.4% | |
| Other values (60) | 1319298 |
None
| Value | Count | Frequency (%) |
| á | 1200 | |
| ü | 991 | |
| í | 928 | |
| ó | 488 | |
| é | 410 | 8.0% |
| ã | 292 | 5.7% |
| ø | 158 | 3.1% |
| ô | 125 | 2.5% |
| è | 117 | 2.3% |
| ä | 54 | 1.1% |
| Other values (33) | 336 | 6.6% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ị | 22 |
Punctuation
| Value | Count | Frequency (%) |
| – | 22 |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 1 |
county
Text
Missing 
| Distinct | 4068 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 254867 |
| Missing (%) | 42.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 51 |
|---|---|
| Median length | 45 |
| Mean length | 9.456280209 |
| Min length | 1 |
Unique
| Unique | 1157 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | [Not Stated] |
|---|---|
| 2nd row | [Not Stated] |
| 3rd row | Aleutians West |
| 4th row | Virginia Beach |
| 5th row | [Not Stated] |
| Value | Count | Frequency (%) |
| not | 132062 | |
| stated | 132060 | |
| boulder | 6789 | 1.3% |
| creek | 6760 | 1.3% |
| clear | 6751 | 1.3% |
| san | 5405 | 1.0% |
| montgomery | 4939 | 0.9% |
| cochise | 4320 | 0.8% |
| prince | 3492 | 0.7% |
| tuolumne | 3206 | 0.6% |
| Other values (4079) | 215282 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 455467 | |
| a | 309900 | 9.4% |
| e | 305731 | 9.2% |
| o | 264738 | 8.0% |
| 171213 | 5.2% | |
| d | 169224 | 5.1% |
| S | 152130 | 4.6% |
| N | 137690 | 4.2% |
| n | 133853 | 4.0% |
| [ | 132080 | 4.0% |
| Other values (88) | 1076282 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2347003 | |
| Uppercase Letter | 519166 | 15.7% |
| Space Separator | 171213 | 5.2% |
| Open Punctuation | 132098 | 4.0% |
| Close Punctuation | 132058 | 4.0% |
| Other Punctuation | 4601 | 0.1% |
| Dash Punctuation | 2168 | 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 455467 | |
| a | 309900 | |
| e | 305731 | |
| o | 264738 | |
| d | 169224 | 7.2% |
| n | 133853 | 5.7% |
| r | 128735 | 5.5% |
| i | 96642 | 4.1% |
| l | 92695 | 3.9% |
| s | 72035 | 3.1% |
| Other values (42) | 317983 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 152130 | |
| N | 137690 | |
| C | 39707 | 7.6% |
| B | 24571 | 4.7% |
| M | 21494 | 4.1% |
| P | 16767 | 3.2% |
| W | 13595 | 2.6% |
| L | 12294 | 2.4% |
| G | 12068 | 2.3% |
| T | 10765 | 2.1% |
| Other values (23) | 78085 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 3065 | |
| . | 1321 | |
| , | 105 | 2.3% |
| / | 56 | 1.2% |
| & | 50 | 1.1% |
| ? | 4 | 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 132080 | |
| ( | 18 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 132040 | |
| ) | 18 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 171213 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2168 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2866169 | |
| Common | 442139 | 13.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 455467 | |
| a | 309900 | |
| e | 305731 | |
| o | 264738 | 9.2% |
| d | 169224 | 5.9% |
| S | 152130 | 5.3% |
| N | 137690 | 4.8% |
| n | 133853 | 4.7% |
| r | 128735 | 4.5% |
| i | 96642 | 3.4% |
| Other values (75) | 712059 |
Common
| Value | Count | Frequency (%) |
| 171213 | ||
| [ | 132080 | |
| ] | 132040 | |
| ' | 3065 | 0.7% |
| - | 2168 | 0.5% |
| . | 1321 | 0.3% |
| , | 105 | < 0.1% |
| / | 56 | < 0.1% |
| & | 50 | < 0.1% |
| ( | 18 | < 0.1% |
| Other values (3) | 23 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3307261 | |
| None | 1047 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 455467 | |
| a | 309900 | 9.4% |
| e | 305731 | 9.2% |
| o | 264738 | 8.0% |
| 171213 | 5.2% | |
| d | 169224 | 5.1% |
| S | 152130 | 4.6% |
| N | 137690 | 4.2% |
| n | 133853 | 4.0% |
| [ | 132080 | 4.0% |
| Other values (55) | 1075235 |
None
| Value | Count | Frequency (%) |
| é | 285 | |
| ó | 235 | |
| ü | 123 | |
| í | 99 | 9.5% |
| ô | 74 | 7.1% |
| Ñ | 29 | 2.8% |
| á | 27 | 2.6% |
| è | 18 | 1.7% |
| ś | 16 | 1.5% |
| ć | 15 | 1.4% |
| Other values (23) | 126 |
locality
Text
Missing 
| Distinct | 76621 |
|---|---|
| Distinct (%) | 17.2% |
| Missing | 158363 |
| Missing (%) | 26.2% |
| Memory size | 4.6 MiB |
Length
| Max length | 550043 |
|---|---|
| Median length | 182 |
| Mean length | 24.13015367 |
| Min length | 1 |
Unique
| Unique | 44463 ? |
|---|---|
| Unique (%) | 10.0% |
Sample
| 1st row | [Not Stated] |
|---|---|
| 2nd row | Rio Aquiares, Turrialba |
| 3rd row | Saint Paul Island, Bering Sea |
| 4th row | False Cape State Park, Wash Woods, 100 meters east of Interpreter's residence |
| 5th row | [Not Stated] |
| Value | Count | Frequency (%) |
| not | 66601 | 4.1% |
| stated | 66524 | 4.1% |
| of | 42103 | 2.6% |
| miles | 21225 | 1.3% |
| kilometers | 15789 | 1.0% |
| park | 15479 | 1.0% |
| river | 15374 | 1.0% |
| lake | 14864 | 0.9% |
| near | 12865 | 0.8% |
| creek | 12692 | 0.8% |
| Other values (59148) | 1327951 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1113923 | 10.3% | |
| a | 970976 | 9.0% |
| e | 784777 | 7.3% |
| o | 677654 | 6.3% |
| t | 644226 | 6.0% |
| n | 525770 | 4.9% |
| i | 505577 | 4.7% |
| r | 496321 | 4.6% |
| l | 397845 | 3.7% |
| s | 367559 | 3.4% |
| Other values (138) | 4286035 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7196462 | |
| Uppercase Letter | 1266334 | 11.8% |
| Space Separator | 1113923 | 10.3% |
| Decimal Number | 367182 | 3.4% |
| Other Punctuation | 344983 | 3.2% |
| Control | 288676 | 2.7% |
| Open Punctuation | 78963 | 0.7% |
| Close Punctuation | 78951 | 0.7% |
| Dash Punctuation | 33528 | 0.3% |
| Math Symbol | 1306 | < 0.1% |
| Other values (6) | 355 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 970976 | |
| e | 784777 | |
| o | 677654 | |
| t | 644226 | |
| n | 525770 | 7.3% |
| i | 505577 | 7.0% |
| r | 496321 | 6.9% |
| l | 397845 | 5.5% |
| s | 367559 | 5.1% |
| u | 258028 | 3.6% |
| Other values (48) | 1567729 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 174256 | |
| N | 124539 | 9.8% |
| C | 121685 | 9.6% |
| P | 93135 | 7.4% |
| R | 84162 | 6.6% |
| M | 83089 | 6.6% |
| B | 66969 | 5.3% |
| L | 58035 | 4.6% |
| A | 52250 | 4.1% |
| F | 47325 | 3.7% |
| Other values (30) | 360889 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 157009 | |
| . | 82199 | |
| ; | 57201 | 16.6% |
| : | 21295 | 6.2% |
| / | 12921 | 3.7% |
| ' | 9409 | 2.7% |
| ? | 1932 | 0.6% |
| " | 1379 | 0.4% |
| & | 936 | 0.3% |
| # | 665 | 0.2% |
| Other values (5) | 37 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 65034 | |
| 1 | 61769 | |
| 2 | 43526 | |
| 3 | 35033 | |
| 5 | 34550 | |
| 6 | 29303 | |
| 4 | 27484 | |
| 8 | 23936 | 6.5% |
| 9 | 23798 | 6.5% |
| 7 | 22749 | 6.2% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 649 | |
| + | 302 | |
| ~ | 250 | 19.1% |
| | | 102 | 7.8% |
| < | 2 | 0.2% |
| > | 1 | 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 70524 | |
| ( | 8335 | 10.6% |
| { | 103 | 0.1% |
| ‚ | 1 | < 0.1% |
Control
| Value | Count | Frequency (%) |
| 287154 | ||
| 1520 | 0.5% | |
| | 2 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 70480 | |
| ) | 8328 | 10.5% |
| } | 143 | 0.2% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 3 | |
| ¯ | 2 |
Space Separator
| Value | Count | Frequency (%) |
| 1113923 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 33528 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 134 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 134 |
Currency Symbol
| Value | Count | Frequency (%) |
| ¢ | 50 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 26 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 6 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8462796 | |
| Common | 2307867 | 21.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 970976 | 11.5% |
| e | 784777 | 9.3% |
| o | 677654 | 8.0% |
| t | 644226 | 7.6% |
| n | 525770 | 6.2% |
| i | 505577 | 6.0% |
| r | 496321 | 5.9% |
| l | 397845 | 4.7% |
| s | 367559 | 4.3% |
| u | 258028 | 3.0% |
| Other values (88) | 2834063 |
Common
| Value | Count | Frequency (%) |
| 1113923 | ||
| 287154 | 12.4% | |
| , | 157009 | 6.8% |
| . | 82199 | 3.6% |
| [ | 70524 | 3.1% |
| ] | 70480 | 3.1% |
| 0 | 65034 | 2.8% |
| 1 | 61769 | 2.7% |
| ; | 57201 | 2.5% |
| 2 | 43526 | 1.9% |
| Other values (40) | 299048 | 13.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10768129 | |
| None | 2500 | < 0.1% |
| Punctuation | 34 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1113923 | 10.3% | |
| a | 970976 | 9.0% |
| e | 784777 | 7.3% |
| o | 677654 | 6.3% |
| t | 644226 | 6.0% |
| n | 525770 | 4.9% |
| i | 505577 | 4.7% |
| r | 496321 | 4.6% |
| l | 397845 | 3.7% |
| s | 367559 | 3.4% |
| Other values (82) | 4283501 |
None
| Value | Count | Frequency (%) |
| ñ | 374 | |
| ó | 346 | |
| á | 338 | |
| é | 321 | |
| ã | 219 | |
| ü | 178 | |
| í | 138 | 5.5% |
| ° | 134 | 5.4% |
| ç | 115 | 4.6% |
| ¢ | 50 | 2.0% |
| Other values (42) | 287 |
Punctuation
| Value | Count | Frequency (%) |
| ” | 26 | |
| “ | 6 | 17.6% |
| … | 1 | 2.9% |
| ‚ | 1 | 2.9% |
Missing 
| Distinct | 1812 |
|---|---|
| Distinct (%) | 3.9% |
| Missing | 558058 |
| Missing (%) | 92.3% |
| Memory size | 4.6 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 5.369958424 |
| Min length | 3 |
Unique
| Unique | 454 ? |
|---|---|
| Unique (%) | 1.0% |
Sample
| 1st row | 2040.0 |
|---|---|
| 2nd row | 240.0 |
| 3rd row | 165.0 |
| 4th row | 400.0 |
| 5th row | 1300.0 |
| Value | Count | Frequency (%) |
| 2743.0 | 1183 | 2.5% |
| 3353.0 | 909 | 1.9% |
| 1829.0 | 812 | 1.7% |
| 610.0 | 652 | 1.4% |
| 1524.0 | 627 | 1.3% |
| 914.0 | 612 | 1.3% |
| 427.0 | 567 | 1.2% |
| 1100.0 | 562 | 1.2% |
| 200.0 | 531 | 1.1% |
| 1372.0 | 519 | 1.1% |
| Other values (1798) | 39688 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 75959 | |
| . | 46662 | |
| 1 | 25391 | 10.1% |
| 2 | 21165 | 8.4% |
| 3 | 15751 | 6.3% |
| 5 | 14062 | 5.6% |
| 4 | 13695 | 5.5% |
| 7 | 11236 | 4.5% |
| 9 | 9362 | 3.7% |
| 6 | 9353 | 3.7% |
| Other values (2) | 7937 | 3.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 203889 | |
| Other Punctuation | 46662 | 18.6% |
| Dash Punctuation | 22 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 75959 | |
| 1 | 25391 | 12.5% |
| 2 | 21165 | 10.4% |
| 3 | 15751 | 7.7% |
| 5 | 14062 | 6.9% |
| 4 | 13695 | 6.7% |
| 7 | 11236 | 5.5% |
| 9 | 9362 | 4.6% |
| 6 | 9353 | 4.6% |
| 8 | 7915 | 3.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 46662 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 22 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 250573 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 75959 | |
| . | 46662 | |
| 1 | 25391 | 10.1% |
| 2 | 21165 | 8.4% |
| 3 | 15751 | 6.3% |
| 5 | 14062 | 5.6% |
| 4 | 13695 | 5.5% |
| 7 | 11236 | 4.5% |
| 9 | 9362 | 3.7% |
| 6 | 9353 | 3.7% |
| Other values (2) | 7937 | 3.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 250573 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 75959 | |
| . | 46662 | |
| 1 | 25391 | 10.1% |
| 2 | 21165 | 8.4% |
| 3 | 15751 | 6.3% |
| 5 | 14062 | 5.6% |
| 4 | 13695 | 5.5% |
| 7 | 11236 | 4.5% |
| 9 | 9362 | 3.7% |
| 6 | 9353 | 3.7% |
| Other values (2) | 7937 | 3.2% |
Missing 
| Distinct | 1534 |
|---|---|
| Distinct (%) | 4.9% |
| Missing | 573266 |
| Missing (%) | 94.8% |
| Memory size | 4.6 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 6 |
| Mean length | 5.472658485 |
| Min length | 3 |
Unique
| Unique | 401 ? |
|---|---|
| Unique (%) | 1.3% |
Sample
| 1st row | 2040.0 |
|---|---|
| 2nd row | 240.0 |
| 3rd row | 165.0 |
| 4th row | 400.0 |
| 5th row | 1300.0 |
| Value | Count | Frequency (%) |
| 3353.0 | 850 | 2.7% |
| 2438.0 | 719 | 2.3% |
| 1829.0 | 717 | 2.3% |
| 1524.0 | 582 | 1.9% |
| 2743.0 | 553 | 1.8% |
| 427.0 | 467 | 1.5% |
| 1200.0 | 465 | 1.5% |
| 1372.0 | 453 | 1.4% |
| 2134.0 | 424 | 1.3% |
| 2499.0 | 416 | 1.3% |
| Other values (1523) | 25808 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 51748 | |
| . | 31454 | |
| 1 | 16786 | 9.8% |
| 2 | 15345 | 8.9% |
| 3 | 10880 | 6.3% |
| 4 | 9719 | 5.6% |
| 5 | 9555 | 5.6% |
| 7 | 7998 | 4.6% |
| 9 | 6255 | 3.6% |
| 8 | 6241 | 3.6% |
| Other values (2) | 6156 | 3.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 140671 | |
| Other Punctuation | 31454 | 18.3% |
| Dash Punctuation | 12 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 51748 | |
| 1 | 16786 | 11.9% |
| 2 | 15345 | 10.9% |
| 3 | 10880 | 7.7% |
| 4 | 9719 | 6.9% |
| 5 | 9555 | 6.8% |
| 7 | 7998 | 5.7% |
| 9 | 6255 | 4.4% |
| 8 | 6241 | 4.4% |
| 6 | 6144 | 4.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 31454 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 12 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 172137 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 51748 | |
| . | 31454 | |
| 1 | 16786 | 9.8% |
| 2 | 15345 | 8.9% |
| 3 | 10880 | 6.3% |
| 4 | 9719 | 5.6% |
| 5 | 9555 | 5.6% |
| 7 | 7998 | 4.6% |
| 9 | 6255 | 3.6% |
| 8 | 6241 | 3.6% |
| Other values (2) | 6156 | 3.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 172137 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 51748 | |
| . | 31454 | |
| 1 | 16786 | 9.8% |
| 2 | 15345 | 8.9% |
| 3 | 10880 | 6.3% |
| 4 | 9719 | 5.6% |
| 5 | 9555 | 5.6% |
| 7 | 7998 | 4.6% |
| 9 | 6255 | 3.6% |
| 8 | 6241 | 3.6% |
| Other values (2) | 6156 | 3.6% |
Missing 
| Distinct | 1024 |
|---|---|
| Distinct (%) | 10.3% |
| Missing | 594785 |
| Missing (%) | 98.4% |
| Memory size | 4.6 MiB |
Length
| Max length | 94 |
|---|---|
| Median length | 31 |
| Mean length | 8.088173125 |
| Min length | 1 |
Unique
| Unique | 334 ? |
|---|---|
| Unique (%) | 3.4% |
Sample
| 1st row | 140 meters |
|---|---|
| 2nd row | 3900 feet |
| 3rd row | 5940 feet |
| 4th row | 180 meters |
| 5th row | 3000 feet |
| Value | Count | Frequency (%) |
| m | 2783 | 14.5% |
| feet | 2472 | 12.9% |
| meters | 1521 | 7.9% |
| ft | 1465 | 7.6% |
| 1000 | 347 | 1.8% |
| level | 318 | 1.7% |
| sea | 318 | 1.7% |
| 300 | 305 | 1.6% |
| near | 276 | 1.4% |
| 3200 | 236 | 1.2% |
| Other values (619) | 9193 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 16890 | |
| e | 9358 | |
| 9299 | ||
| t | 5738 | 7.1% |
| m | 5103 | 6.4% |
| f | 4103 | 5.1% |
| 1 | 4089 | 5.1% |
| 5 | 3791 | 4.7% |
| 2 | 2913 | 3.6% |
| . | 2459 | 3.1% |
| Other values (44) | 16613 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 36344 | |
| Lowercase Letter | 30894 | |
| Space Separator | 9299 | 11.6% |
| Other Punctuation | 2946 | 3.7% |
| Dash Punctuation | 765 | 1.0% |
| Uppercase Letter | 44 | 0.1% |
| Open Punctuation | 23 | < 0.1% |
| Close Punctuation | 23 | < 0.1% |
| Math Symbol | 18 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 9358 | |
| t | 5738 | |
| m | 5103 | |
| f | 4103 | |
| r | 1891 | 6.1% |
| s | 1851 | 6.0% |
| a | 854 | 2.8% |
| l | 695 | 2.2% |
| n | 346 | 1.1% |
| v | 331 | 1.1% |
| Other values (12) | 624 | 2.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 16890 | |
| 1 | 4089 | 11.3% |
| 5 | 3791 | 10.4% |
| 2 | 2913 | 8.0% |
| 3 | 2121 | 5.8% |
| 4 | 1908 | 5.2% |
| 6 | 1282 | 3.5% |
| 7 | 1249 | 3.4% |
| 8 | 1174 | 3.2% |
| 9 | 927 | 2.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 30 | |
| N | 5 | 11.4% |
| L | 3 | 6.8% |
| A | 2 | 4.5% |
| P | 1 | 2.3% |
| B | 1 | 2.3% |
| S | 1 | 2.3% |
| W | 1 | 2.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2459 | |
| ' | 338 | 11.5% |
| , | 126 | 4.3% |
| & | 13 | 0.4% |
| ? | 9 | 0.3% |
| / | 1 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 22 | |
| [ | 1 | 4.3% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 22 | |
| ] | 1 | 4.3% |
Math Symbol
| Value | Count | Frequency (%) |
| ~ | 17 | |
| + | 1 | 5.6% |
Space Separator
| Value | Count | Frequency (%) |
| 9299 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 765 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 49418 | |
| Latin | 30938 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 9358 | |
| t | 5738 | |
| m | 5103 | |
| f | 4103 | |
| r | 1891 | 6.1% |
| s | 1851 | 6.0% |
| a | 854 | 2.8% |
| l | 695 | 2.2% |
| n | 346 | 1.1% |
| v | 331 | 1.1% |
| Other values (20) | 668 | 2.2% |
Common
| Value | Count | Frequency (%) |
| 0 | 16890 | |
| 9299 | ||
| 1 | 4089 | 8.3% |
| 5 | 3791 | 7.7% |
| 2 | 2913 | 5.9% |
| . | 2459 | 5.0% |
| 3 | 2121 | 4.3% |
| 4 | 1908 | 3.9% |
| 6 | 1282 | 2.6% |
| 7 | 1249 | 2.5% |
| Other values (14) | 3417 | 6.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 80356 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 16890 | |
| e | 9358 | |
| 9299 | ||
| t | 5738 | 7.1% |
| m | 5103 | 6.4% |
| f | 4103 | 5.1% |
| 1 | 4089 | 5.1% |
| 5 | 3791 | 4.7% |
| 2 | 2913 | 3.6% |
| . | 2459 | 3.1% |
| Other values (44) | 16613 |
Missing 
| Distinct | 13 |
|---|---|
| Distinct (%) | 37.1% |
| Missing | 604685 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 16 |
|---|---|
| Median length | 5 |
| Mean length | 5.114285714 |
| Min length | 3 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | 20.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 250.0 |
| 3rd row | 0.0 |
| 4th row | Argia orichalcea |
| 5th row | 370.0 |
| Value | Count | Frequency (%) |
| 250.0 | 9 | |
| 0.0 | 6 | |
| 880.0 | 6 | |
| 370.0 | 3 | 8.3% |
| 1707.0 | 2 | 5.6% |
| 775.0 | 2 | 5.6% |
| argia | 1 | 2.8% |
| orichalcea | 1 | 2.8% |
| 359.0 | 1 | 2.8% |
| 1400.0 | 1 | 2.8% |
| Other values (4) | 4 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 68 | |
| . | 34 | |
| 5 | 13 | 7.3% |
| 7 | 13 | 7.3% |
| 8 | 12 | 6.7% |
| 2 | 9 | 5.0% |
| 3 | 6 | 3.4% |
| 1 | 4 | 2.2% |
| a | 3 | 1.7% |
| 4 | 2 | 1.1% |
| Other values (12) | 15 | 8.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 129 | |
| Other Punctuation | 34 | 19.0% |
| Lowercase Letter | 14 | 7.8% |
| Space Separator | 1 | 0.6% |
| Uppercase Letter | 1 | 0.6% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 68 | |
| 5 | 13 | 10.1% |
| 7 | 13 | 10.1% |
| 8 | 12 | 9.3% |
| 2 | 9 | 7.0% |
| 3 | 6 | 4.7% |
| 1 | 4 | 3.1% |
| 4 | 2 | 1.6% |
| 9 | 1 | 0.8% |
| 6 | 1 | 0.8% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3 | |
| c | 2 | |
| i | 2 | |
| r | 2 | |
| g | 1 | 7.1% |
| o | 1 | 7.1% |
| h | 1 | 7.1% |
| l | 1 | 7.1% |
| e | 1 | 7.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 34 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 164 | |
| Latin | 15 | 8.4% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 68 | |
| . | 34 | |
| 5 | 13 | 7.9% |
| 7 | 13 | 7.9% |
| 8 | 12 | 7.3% |
| 2 | 9 | 5.5% |
| 3 | 6 | 3.7% |
| 1 | 4 | 2.4% |
| 4 | 2 | 1.2% |
| 1 | 0.6% | |
| Other values (2) | 2 | 1.2% |
Latin
| Value | Count | Frequency (%) |
| a | 3 | |
| c | 2 | |
| i | 2 | |
| r | 2 | |
| g | 1 | 6.7% |
| o | 1 | 6.7% |
| h | 1 | 6.7% |
| l | 1 | 6.7% |
| e | 1 | 6.7% |
| A | 1 | 6.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 179 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 68 | |
| . | 34 | |
| 5 | 13 | 7.3% |
| 7 | 13 | 7.3% |
| 8 | 12 | 6.7% |
| 2 | 9 | 5.0% |
| 3 | 6 | 3.4% |
| 1 | 4 | 2.2% |
| a | 3 | 1.7% |
| 4 | 2 | 1.1% |
| Other values (12) | 15 | 8.4% |
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 36.4% |
| Missing | 604709 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 5.090909091 |
| Min length | 5 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 18.2% |
Sample
| 1st row | 220.0 |
|---|---|
| 2nd row | 220.0 |
| 3rd row | 370.0 |
| 4th row | 220.0 |
| 5th row | 1400.0 |
| Value | Count | Frequency (%) |
| 220.0 | 6 | |
| 370.0 | 3 | |
| 1400.0 | 1 | 9.1% |
| 500.0 | 1 | 9.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 24 | |
| 2 | 12 | |
| . | 11 | |
| 3 | 3 | 5.4% |
| 7 | 3 | 5.4% |
| 1 | 1 | 1.8% |
| 4 | 1 | 1.8% |
| 5 | 1 | 1.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 45 | |
| Other Punctuation | 11 | 19.6% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 24 | |
| 2 | 12 | |
| 3 | 3 | 6.7% |
| 7 | 3 | 6.7% |
| 1 | 1 | 2.2% |
| 4 | 1 | 2.2% |
| 5 | 1 | 2.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 11 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 56 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 24 | |
| 2 | 12 | |
| . | 11 | |
| 3 | 3 | 5.4% |
| 7 | 3 | 5.4% |
| 1 | 1 | 1.8% |
| 4 | 1 | 1.8% |
| 5 | 1 | 1.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 56 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 24 | |
| 2 | 12 | |
| . | 11 | |
| 3 | 3 | 5.4% |
| 7 | 3 | 5.4% |
| 1 | 1 | 1.8% |
| 4 | 1 | 1.8% |
| 5 | 1 | 1.8% |
verbatimDepth
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 16.7% |
| Missing | 604714 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 25 |
|---|---|
| Median length | 25 |
| Mean length | 25 |
| Min length | 25 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 220m inside cave entrance |
|---|---|
| 2nd row | 220m inside cave entrance |
| 3rd row | 220m inside cave entrance |
| 4th row | 220m inside cave entrance |
| 5th row | 220m inside cave entrance |
| Value | Count | Frequency (%) |
| 220m | 6 | |
| inside | 6 | |
| cave | 6 | |
| entrance | 6 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 24 | |
| 18 | ||
| n | 18 | |
| 2 | 12 | |
| i | 12 | |
| c | 12 | |
| a | 12 | |
| 0 | 6 | 4.0% |
| m | 6 | 4.0% |
| s | 6 | 4.0% |
| Other values (4) | 24 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 114 | |
| Space Separator | 18 | 12.0% |
| Decimal Number | 18 | 12.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 24 | |
| n | 18 | |
| i | 12 | |
| c | 12 | |
| a | 12 | |
| m | 6 | 5.3% |
| s | 6 | 5.3% |
| d | 6 | 5.3% |
| v | 6 | 5.3% |
| t | 6 | 5.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 12 | |
| 0 | 6 |
Space Separator
| Value | Count | Frequency (%) |
| 18 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 114 | |
| Common | 36 | 24.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 24 | |
| n | 18 | |
| i | 12 | |
| c | 12 | |
| a | 12 | |
| m | 6 | 5.3% |
| s | 6 | 5.3% |
| d | 6 | 5.3% |
| v | 6 | 5.3% |
| t | 6 | 5.3% |
Common
| Value | Count | Frequency (%) |
| 18 | ||
| 2 | 12 | |
| 0 | 6 | 16.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 150 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 24 | |
| 18 | ||
| n | 18 | |
| 2 | 12 | |
| i | 12 | |
| c | 12 | |
| a | 12 | |
| 0 | 6 | 4.0% |
| m | 6 | 4.0% |
| s | 6 | 4.0% |
| Other values (4) | 24 |
locationRemarks
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604719 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Garrison, Rosser W. |
|---|
| Value | Count | Frequency (%) |
| garrison | 1 | |
| rosser | 1 | |
| w | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 3 | |
| s | 3 | |
| o | 2 | |
| 2 | ||
| G | 1 | 5.3% |
| a | 1 | 5.3% |
| i | 1 | 5.3% |
| n | 1 | 5.3% |
| , | 1 | 5.3% |
| R | 1 | 5.3% |
| Other values (3) | 3 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12 | |
| Uppercase Letter | 3 | 15.8% |
| Space Separator | 2 | 10.5% |
| Other Punctuation | 2 | 10.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 3 | |
| s | 3 | |
| o | 2 | |
| a | 1 | 8.3% |
| i | 1 | 8.3% |
| n | 1 | 8.3% |
| e | 1 | 8.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 1 | |
| R | 1 | |
| W | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1 | |
| . | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15 | |
| Common | 4 | 21.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 3 | |
| s | 3 | |
| o | 2 | |
| G | 1 | 6.7% |
| a | 1 | 6.7% |
| i | 1 | 6.7% |
| n | 1 | 6.7% |
| R | 1 | 6.7% |
| e | 1 | 6.7% |
| W | 1 | 6.7% |
Common
| Value | Count | Frequency (%) |
| 2 | ||
| , | 1 | |
| . | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 19 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 3 | |
| s | 3 | |
| o | 2 | |
| 2 | ||
| G | 1 | 5.3% |
| a | 1 | 5.3% |
| i | 1 | 5.3% |
| n | 1 | 5.3% |
| , | 1 | 5.3% |
| R | 1 | 5.3% |
| Other values (3) | 3 |
decimalLatitude
Text
Missing 
| Distinct | 38000 |
|---|---|
| Distinct (%) | 11.9% |
| Missing | 285696 |
| Missing (%) | 47.2% |
| Memory size | 4.6 MiB |
Length
| Max length | 65 |
|---|---|
| Median length | 7 |
| Mean length | 6.690020187 |
| Min length | 3 |
Unique
| Unique | 15792 ? |
|---|---|
| Unique (%) | 5.0% |
Sample
| 1st row | 9.91378 |
|---|---|
| 2nd row | 57.18 |
| 3rd row | 36.5787 |
| 4th row | 15.5864 |
| 5th row | 45.4838 |
| Value | Count | Frequency (%) |
| 39.6891 | 5053 | 1.6% |
| 60.75 | 3840 | 1.2% |
| 60.7493 | 2462 | 0.8% |
| 40.0925 | 2379 | 0.7% |
| 38.02 | 2014 | 0.6% |
| 42.7299 | 1697 | 0.5% |
| 37.23 | 1343 | 0.4% |
| 40.015 | 1287 | 0.4% |
| 42.78 | 1170 | 0.4% |
| 38.9559 | 1141 | 0.4% |
| Other values (37318) | 296643 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 319023 | |
| 3 | 273800 | |
| 4 | 209113 | |
| 1 | 188958 | |
| 2 | 172350 | |
| 9 | 169602 | |
| 7 | 165610 | |
| 8 | 159004 | |
| 5 | 153218 | |
| 6 | 152394 | |
| Other values (26) | 171205 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1774064 | |
| Other Punctuation | 319028 | 14.9% |
| Dash Punctuation | 41124 | 1.9% |
| Lowercase Letter | 49 | < 0.1% |
| Uppercase Letter | 7 | < 0.1% |
| Space Separator | 5 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 9 | |
| o | 6 | |
| n | 5 | |
| i | 4 | |
| e | 4 | |
| r | 4 | |
| t | 4 | |
| d | 3 | 6.1% |
| g | 2 | 4.1% |
| p | 2 | 4.1% |
| Other values (6) | 6 |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 273800 | |
| 4 | 209113 | |
| 1 | 188958 | |
| 2 | 172350 | |
| 9 | 169602 | |
| 7 | 165610 | |
| 8 | 159004 | |
| 5 | 153218 | |
| 6 | 152394 | |
| 0 | 130015 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 2 | |
| Z | 1 | |
| O | 1 | |
| I | 1 | |
| E | 1 | |
| C | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 319023 | |
| , | 5 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 41124 |
Space Separator
| Value | Count | Frequency (%) |
| 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2134221 | |
| Latin | 56 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 9 | |
| o | 6 | |
| n | 5 | 8.9% |
| i | 4 | 7.1% |
| e | 4 | 7.1% |
| r | 4 | 7.1% |
| t | 4 | 7.1% |
| d | 3 | 5.4% |
| A | 2 | 3.6% |
| g | 2 | 3.6% |
| Other values (12) | 13 |
Common
| Value | Count | Frequency (%) |
| . | 319023 | |
| 3 | 273800 | |
| 4 | 209113 | |
| 1 | 188958 | |
| 2 | 172350 | |
| 9 | 169602 | |
| 7 | 165610 | |
| 8 | 159004 | |
| 5 | 153218 | |
| 6 | 152394 | |
| Other values (4) | 171149 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2134277 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 319023 | |
| 3 | 273800 | |
| 4 | 209113 | |
| 1 | 188958 | |
| 2 | 172350 | |
| 9 | 169602 | |
| 7 | 165610 | |
| 8 | 159004 | |
| 5 | 153218 | |
| 6 | 152394 | |
| Other values (26) | 171205 |
decimalLongitude
Text
Missing 
| Distinct | 36959 |
|---|---|
| Distinct (%) | 11.6% |
| Missing | 285696 |
| Missing (%) | 47.2% |
| Memory size | 4.6 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 7.477506395 |
| Min length | 3 |
Unique
| Unique | 15086 ? |
|---|---|
| Unique (%) | 4.7% |
Sample
| 1st row | -83.6744 |
|---|---|
| 2nd row | -170.27 |
| 3rd row | -75.8881 |
| 4th row | -61.4739 |
| 5th row | -75.9727 |
| Value | Count | Frequency (%) |
| 105.644 | 5103 | 1.6% |
| 139.5 | 3838 | 1.2% |
| 139.504 | 2462 | 0.8% |
| 105.358 | 2379 | 0.7% |
| 87.8123 | 1697 | 0.5% |
| 119.93 | 1404 | 0.4% |
| 105.27 | 1361 | 0.4% |
| 80.4178 | 1322 | 0.4% |
| 0.365 | 1301 | 0.4% |
| 87.76 | 1163 | 0.4% |
| Other values (36449) | 296994 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 319023 | |
| 1 | 292933 | |
| - | 270766 | |
| 7 | 217532 | |
| 8 | 193895 | |
| 6 | 165418 | |
| 5 | 162723 | |
| 3 | 158480 | |
| 2 | 156818 | |
| 9 | 154493 | |
| Other values (8) | 293423 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1795707 | |
| Other Punctuation | 319023 | 13.4% |
| Dash Punctuation | 270766 | 11.4% |
| Lowercase Letter | 7 | < 0.1% |
| Uppercase Letter | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 292933 | |
| 7 | 217532 | |
| 8 | 193895 | |
| 6 | 165418 | |
| 5 | 162723 | |
| 3 | 158480 | |
| 2 | 156818 | |
| 9 | 154493 | |
| 4 | 148399 | |
| 0 | 145016 |
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 2 | |
| a | 2 | |
| n | 1 | |
| m | 1 | |
| l | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 319023 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 270766 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2385496 | |
| Latin | 8 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 319023 | |
| 1 | 292933 | |
| - | 270766 | |
| 7 | 217532 | |
| 8 | 193895 | |
| 6 | 165418 | |
| 5 | 162723 | |
| 3 | 158480 | |
| 2 | 156818 | |
| 9 | 154493 | |
| Other values (2) | 293415 |
Latin
| Value | Count | Frequency (%) |
| i | 2 | |
| a | 2 | |
| A | 1 | |
| n | 1 | |
| m | 1 | |
| l | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2385504 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 319023 | |
| 1 | 292933 | |
| - | 270766 | |
| 7 | 217532 | |
| 8 | 193895 | |
| 6 | 165418 | |
| 5 | 162723 | |
| 3 | 158480 | |
| 2 | 156818 | |
| 9 | 154493 | |
| Other values (8) | 293423 |
geodeticDatum
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 578337 |
| Missing (%) | 95.6% |
| Memory size | 4.6 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 18 |
| Mean length | 17.50456734 |
| Min length | 5 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | WGS 84 (EPSG:4326) |
|---|---|
| 2nd row | WGS 84 (EPSG:4326) |
| 3rd row | WGS 84 (EPSG:4326) |
| 4th row | WGS 84 (EPSG:4326) |
| 5th row | WGS 84 (EPSG:4326) |
| Value | Count | Frequency (%) |
| wgs | 25014 | |
| 84 | 25014 | |
| epsg:4326 | 25008 | |
| wgs84 | 754 | 1.0% |
| nad83 | 399 | 0.5% |
| epsg:4269 | 399 | 0.5% |
| wgs40 | 214 | 0.3% |
| arthropoda | 1 | < 0.1% |
| 1973-05-08 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| G | 51389 | |
| S | 51389 | |
| 4 | 51389 | |
| 50421 | ||
| 8 | 26168 | 5.7% |
| W | 25982 | 5.6% |
| 3 | 25408 | 5.5% |
| ( | 25407 | 5.5% |
| E | 25407 | 5.5% |
| P | 25407 | 5.5% |
| Other values (20) | 103456 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 180772 | |
| Decimal Number | 154398 | |
| Space Separator | 50421 | 10.9% |
| Open Punctuation | 25407 | 5.5% |
| Other Punctuation | 25407 | 5.5% |
| Close Punctuation | 25407 | 5.5% |
| Lowercase Letter | 9 | < 0.1% |
| Dash Punctuation | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 51389 | |
| 8 | 26168 | |
| 3 | 25408 | |
| 2 | 25407 | |
| 6 | 25407 | |
| 9 | 400 | 0.3% |
| 0 | 216 | 0.1% |
| 1 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 51389 | |
| S | 51389 | |
| W | 25982 | |
| E | 25407 | |
| P | 25407 | |
| A | 400 | 0.2% |
| D | 399 | 0.2% |
| N | 399 | 0.2% |
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 2 | |
| o | 2 | |
| t | 1 | |
| h | 1 | |
| p | 1 | |
| d | 1 | |
| a | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 50421 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 25407 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 25407 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 25407 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 281042 | |
| Latin | 180781 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| G | 51389 | |
| S | 51389 | |
| W | 25982 | |
| E | 25407 | |
| P | 25407 | |
| A | 400 | 0.2% |
| D | 399 | 0.2% |
| N | 399 | 0.2% |
| r | 2 | < 0.1% |
| o | 2 | < 0.1% |
| Other values (5) | 5 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 4 | 51389 | |
| 50421 | ||
| 8 | 26168 | |
| 3 | 25408 | |
| ( | 25407 | |
| : | 25407 | |
| 2 | 25407 | |
| 6 | 25407 | |
| ) | 25407 | |
| 9 | 400 | 0.1% |
| Other values (5) | 221 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 461823 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| G | 51389 | |
| S | 51389 | |
| 4 | 51389 | |
| 50421 | ||
| 8 | 26168 | 5.7% |
| W | 25982 | 5.6% |
| 3 | 25408 | 5.5% |
| ( | 25407 | 5.5% |
| E | 25407 | 5.5% |
| P | 25407 | 5.5% |
| Other values (20) | 103456 |
coordinateUncertaintyInMeters
Text
Missing 
| Distinct | 1494 |
|---|---|
| Distinct (%) | 12.5% |
| Missing | 592766 |
| Missing (%) | 98.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 4 |
| Mean length | 4.138698344 |
| Min length | 2 |
Unique
| Unique | 746 ? |
|---|---|
| Unique (%) | 6.2% |
Sample
| 1st row | 931 |
|---|---|
| 2nd row | 10206 |
| 3rd row | 6642 |
| 4th row | 3036 |
| 5th row | 301 |
| Value | Count | Frequency (%) |
| 3036 | 1744 | 14.6% |
| 301 | 466 | 3.9% |
| 34239 | 426 | 3.6% |
| 1189 | 258 | 2.2% |
| 20000 | 247 | 2.1% |
| 3048 | 220 | 1.8% |
| 15000 | 199 | 1.7% |
| 52150 | 194 | 1.6% |
| 14563 | 162 | 1.4% |
| 9346 | 135 | 1.1% |
| Other values (1484) | 7903 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 9238 | |
| 3 | 8252 | |
| 1 | 6353 | |
| 2 | 4894 | |
| 6 | 4647 | |
| 4 | 3910 | |
| 5 | 3501 | 7.1% |
| 9 | 3065 | 6.2% |
| 8 | 2862 | 5.8% |
| 7 | 2745 | 5.5% |
| Other values (7) | 7 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 49467 | |
| Lowercase Letter | 6 | < 0.1% |
| Uppercase Letter | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 9238 | |
| 3 | 8252 | |
| 1 | 6353 | |
| 2 | 4894 | |
| 6 | 4647 | |
| 4 | 3910 | |
| 5 | 3501 | 7.1% |
| 9 | 3065 | 6.2% |
| 8 | 2862 | 5.8% |
| 7 | 2745 | 5.5% |
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 1 | |
| s | 1 | |
| e | 1 | |
| c | 1 | |
| t | 1 | |
| a | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 49467 | |
| Latin | 7 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 9238 | |
| 3 | 8252 | |
| 1 | 6353 | |
| 2 | 4894 | |
| 6 | 4647 | |
| 4 | 3910 | |
| 5 | 3501 | 7.1% |
| 9 | 3065 | 6.2% |
| 8 | 2862 | 5.8% |
| 7 | 2745 | 5.5% |
Latin
| Value | Count | Frequency (%) |
| I | 1 | |
| n | 1 | |
| s | 1 | |
| e | 1 | |
| c | 1 | |
| t | 1 | |
| a | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 49474 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 9238 | |
| 3 | 8252 | |
| 1 | 6353 | |
| 2 | 4894 | |
| 6 | 4647 | |
| 4 | 3910 | |
| 5 | 3501 | 7.1% |
| 9 | 3065 | 6.2% |
| 8 | 2862 | 5.8% |
| 7 | 2745 | 5.5% |
| Other values (7) | 7 | < 0.1% |
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604717 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 3 |
| Mean length | 4 |
| Min length | 2 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Odonata |
|---|---|
| 2nd row | 69 |
| 3rd row | 128 |
| Value | Count | Frequency (%) |
| odonata | 1 | |
| 69 | 1 | |
| 128 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2 | |
| O | 1 | |
| d | 1 | |
| o | 1 | |
| n | 1 | |
| t | 1 | |
| 6 | 1 | |
| 9 | 1 | |
| 1 | 1 | |
| 2 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6 | |
| Decimal Number | 5 | |
| Uppercase Letter | 1 | 8.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2 | |
| d | 1 | |
| o | 1 | |
| n | 1 | |
| t | 1 |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 1 | |
| 9 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 8 | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7 | |
| Common | 5 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2 | |
| O | 1 | |
| d | 1 | |
| o | 1 | |
| n | 1 | |
| t | 1 |
Common
| Value | Count | Frequency (%) |
| 6 | 1 | |
| 9 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 8 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2 | |
| O | 1 | |
| d | 1 | |
| o | 1 | |
| n | 1 | |
| t | 1 | |
| 6 | 1 | |
| 9 | 1 | |
| 1 | 1 | |
| 2 | 1 |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604718 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 2.5 |
| Mean length | 2.5 |
| Min length | 2 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 69 |
|---|---|
| 2nd row | 128 |
| Value | Count | Frequency (%) |
| 69 | 1 | |
| 128 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 6 | 1 | |
| 9 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 8 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 1 | |
| 9 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 8 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 6 | 1 | |
| 9 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 8 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 6 | 1 | |
| 9 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 8 | 1 |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604718 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 14 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 4 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Coenagrionidae |
|---|---|
| 2nd row | 1973 |
| Value | Count | Frequency (%) |
| coenagrionidae | 1 | |
| 1973 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 2 | |
| e | 2 | |
| n | 2 | |
| a | 2 | |
| i | 2 | |
| C | 1 | 5.6% |
| g | 1 | 5.6% |
| r | 1 | 5.6% |
| d | 1 | 5.6% |
| 1 | 1 | 5.6% |
| Other values (3) | 3 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 13 | |
| Decimal Number | 4 | 22.2% |
| Uppercase Letter | 1 | 5.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 2 | |
| e | 2 | |
| n | 2 | |
| a | 2 | |
| i | 2 | |
| g | 1 | |
| r | 1 | |
| d | 1 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 9 | 1 | |
| 7 | 1 | |
| 3 | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 14 | |
| Common | 4 | 22.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 2 | |
| e | 2 | |
| n | 2 | |
| a | 2 | |
| i | 2 | |
| C | 1 | |
| g | 1 | |
| r | 1 | |
| d | 1 |
Common
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 9 | 1 | |
| 7 | 1 | |
| 3 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 2 | |
| e | 2 | |
| n | 2 | |
| a | 2 | |
| i | 2 | |
| C | 1 | 5.6% |
| g | 1 | 5.6% |
| r | 1 | 5.6% |
| d | 1 | 5.6% |
| 1 | 1 | 5.6% |
| Other values (3) | 3 |
verbatimLatitude
Text
Missing 
| Distinct | 10290 |
|---|---|
| Distinct (%) | 12.6% |
| Missing | 523062 |
| Missing (%) | 86.5% |
| Memory size | 4.6 MiB |
Length
| Max length | 31 |
|---|---|
| Median length | 9 |
| Mean length | 8.943949154 |
| Min length | 1 |
Unique
| Unique | 3874 ? |
|---|---|
| Unique (%) | 4.7% |
Sample
| 1st row | N36.578717 |
|---|---|
| 2nd row | 0 deg 50' 00" N |
| 3rd row | 3 deg. 21.1' N |
| 4th row | 10 32' S |
| 5th row | 39.079276 |
| Value | Count | Frequency (%) |
| n | 12202 | 10.3% |
| deg | 3779 | 3.2% |
| s | 3061 | 2.6% |
| 40.014986 | 1227 | 1.0% |
| 38.955944 | 1139 | 1.0% |
| 39 | 889 | 0.7% |
| 10 | 854 | 0.7% |
| 12 | 805 | 0.7% |
| 40.001652 | 790 | 0.7% |
| 38 | 783 | 0.7% |
| Other values (9199) | 93254 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 79859 | |
| . | 76634 | |
| 4 | 74529 | |
| 1 | 56463 | 7.7% |
| 2 | 53355 | 7.3% |
| 8 | 51998 | 7.1% |
| 0 | 49153 | 6.7% |
| 9 | 48559 | 6.6% |
| 5 | 48310 | 6.6% |
| 6 | 41823 | 5.7% |
| Other values (44) | 149662 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 544502 | |
| Other Punctuation | 90638 | 12.4% |
| Space Separator | 37125 | 5.1% |
| Uppercase Letter | 29498 | 4.0% |
| Lowercase Letter | 18739 | 2.6% |
| Other Symbol | 5504 | 0.8% |
| Dash Punctuation | 4170 | 0.6% |
| Open Punctuation | 49 | < 0.1% |
| Close Punctuation | 49 | < 0.1% |
| Other Letter | 32 | < 0.1% |
| Other values (4) | 39 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| d | 6147 | |
| g | 5893 | |
| e | 5794 | |
| r | 659 | 3.5% |
| s | 198 | 1.1% |
| t | 11 | 0.1% |
| n | 10 | 0.1% |
| o | 10 | 0.1% |
| h | 5 | < 0.1% |
| l | 3 | < 0.1% |
| Other values (5) | 9 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 79859 | |
| 4 | 74529 | |
| 1 | 56463 | |
| 2 | 53355 | |
| 8 | 51998 | |
| 0 | 49153 | |
| 9 | 48559 | |
| 5 | 48310 | |
| 6 | 41823 | |
| 7 | 40453 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 76634 | |
| ' | 13062 | 14.4% |
| " | 813 | 0.9% |
| : | 79 | 0.1% |
| ′ | 16 | < 0.1% |
| ″ | 15 | < 0.1% |
| & | 13 | < 0.1% |
| , | 4 | < 0.1% |
| \ | 2 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 22689 | |
| S | 6676 | 22.6% |
| W | 81 | 0.3% |
| E | 30 | 0.1% |
| B | 16 | 0.1% |
| D | 5 | < 0.1% |
| M | 1 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 48 | |
| ( | 1 | 2.0% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 48 | |
| ) | 1 | 2.0% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ˚ | 16 | |
| ´ | 9 |
Space Separator
| Value | Count | Frequency (%) |
| 37125 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 5504 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4170 |
Other Letter
| Value | Count | Frequency (%) |
| º | 32 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 8 |
Math Symbol
| Value | Count | Frequency (%) |
| ~ | 4 |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ̊ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 682074 | |
| Latin | 48269 | 6.6% |
| Inherited | 2 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 79859 | |
| . | 76634 | |
| 4 | 74529 | |
| 1 | 56463 | |
| 2 | 53355 | |
| 8 | 51998 | |
| 0 | 49153 | |
| 9 | 48559 | |
| 5 | 48310 | |
| 6 | 41823 | |
| Other values (20) | 101391 |
Latin
| Value | Count | Frequency (%) |
| N | 22689 | |
| S | 6676 | 13.8% |
| d | 6147 | 12.7% |
| g | 5893 | 12.2% |
| e | 5794 | 12.0% |
| r | 659 | 1.4% |
| s | 198 | 0.4% |
| W | 81 | 0.2% |
| º | 32 | 0.1% |
| E | 30 | 0.1% |
| Other values (13) | 70 | 0.1% |
Inherited
| Value | Count | Frequency (%) |
| ̊ | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 724743 | |
| None | 5545 | 0.8% |
| Punctuation | 39 | < 0.1% |
| Modifier Letters | 16 | < 0.1% |
| Diacriticals | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 79859 | |
| . | 76634 | |
| 4 | 74529 | |
| 1 | 56463 | 7.8% |
| 2 | 53355 | 7.4% |
| 8 | 51998 | 7.2% |
| 0 | 49153 | 6.8% |
| 9 | 48559 | 6.7% |
| 5 | 48310 | 6.7% |
| 6 | 41823 | 5.8% |
| Other values (36) | 144060 |
None
| Value | Count | Frequency (%) |
| ° | 5504 | |
| º | 32 | 0.6% |
| ´ | 9 | 0.2% |
Punctuation
| Value | Count | Frequency (%) |
| ′ | 16 | |
| ″ | 15 | |
| ” | 8 |
Modifier Letters
| Value | Count | Frequency (%) |
| ˚ | 16 |
Diacriticals
| Value | Count | Frequency (%) |
| ̊ | 2 |
Missing 
| Distinct | 10183 |
|---|---|
| Distinct (%) | 12.5% |
| Missing | 523032 |
| Missing (%) | 86.5% |
| Memory size | 4.6 MiB |
Length
| Max length | 31 |
|---|---|
| Median length | 28 |
| Mean length | 9.817243659 |
| Min length | 1 |
Unique
| Unique | 3804 ? |
|---|---|
| Unique (%) | 4.7% |
Sample
| 1st row | W75.88805 |
|---|---|
| 2nd row | 66 deg 09' 44" W |
| 3rd row | 59 deg. 40.5' W |
| 4th row | 62 48' W |
| 5th row | -76.59802 |
| Value | Count | Frequency (%) |
| w | 13038 | 11.0% |
| deg | 3758 | 3.2% |
| e | 2358 | 2.0% |
| 105.270546 | 1260 | 1.1% |
| 76.94553 | 1139 | 1.0% |
| 76 | 1012 | 0.9% |
| 59 | 834 | 0.7% |
| 105.307491 | 790 | 0.7% |
| 70 | 782 | 0.7% |
| 77.254426 | 778 | 0.7% |
| Other values (9264) | 92725 |
Most occurring characters
| Value | Count | Frequency (%) |
| 7 | 78854 | 9.8% |
| . | 76662 | 9.6% |
| 1 | 65451 | 8.2% |
| 8 | 61925 | 7.7% |
| 0 | 59445 | 7.4% |
| 5 | 56396 | 7.0% |
| 6 | 55542 | 6.9% |
| - | 52771 | 6.6% |
| 2 | 48852 | 6.1% |
| 3 | 48668 | 6.1% |
| Other values (44) | 197385 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 568041 | |
| Other Punctuation | 90510 | 11.3% |
| Dash Punctuation | 52771 | 6.6% |
| Space Separator | 36786 | 4.6% |
| Uppercase Letter | 29452 | 3.7% |
| Lowercase Letter | 18732 | 2.3% |
| Other Symbol | 5488 | 0.7% |
| Close Punctuation | 51 | < 0.1% |
| Open Punctuation | 49 | < 0.1% |
| Other Letter | 32 | < 0.1% |
| Other values (4) | 39 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| d | 6143 | |
| g | 5886 | |
| e | 5802 | |
| r | 655 | 3.5% |
| s | 196 | 1.0% |
| w | 23 | 0.1% |
| t | 9 | < 0.1% |
| o | 6 | < 0.1% |
| n | 4 | < 0.1% |
| l | 3 | < 0.1% |
| Other values (3) | 5 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 78854 | |
| 1 | 65451 | |
| 8 | 61925 | |
| 0 | 59445 | |
| 5 | 56396 | |
| 6 | 55542 | |
| 2 | 48852 | |
| 3 | 48668 | |
| 4 | 47518 | |
| 9 | 45390 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 76662 | |
| ' | 12870 | 14.2% |
| " | 843 | 0.9% |
| : | 79 | 0.1% |
| ′ | 16 | < 0.1% |
| ″ | 15 | < 0.1% |
| & | 13 | < 0.1% |
| , | 8 | < 0.1% |
| ; | 3 | < 0.1% |
| ? | 1 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 24235 | |
| E | 5087 | 17.3% |
| N | 54 | 0.2% |
| S | 53 | 0.2% |
| L | 16 | 0.1% |
| O | 5 | < 0.1% |
| D | 1 | < 0.1% |
| M | 1 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 49 | |
| ) | 2 | 3.9% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 47 | |
| ( | 2 | 4.1% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ˚ | 16 | |
| ´ | 9 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 52771 |
Space Separator
| Value | Count | Frequency (%) |
| 36786 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 5488 |
Other Letter
| Value | Count | Frequency (%) |
| º | 32 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 8 |
Math Symbol
| Value | Count | Frequency (%) |
| ~ | 4 |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ̊ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 753733 | |
| Latin | 48216 | 6.0% |
| Inherited | 2 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 7 | 78854 | |
| . | 76662 | |
| 1 | 65451 | |
| 8 | 61925 | |
| 0 | 59445 | 7.9% |
| 5 | 56396 | 7.5% |
| 6 | 55542 | 7.4% |
| - | 52771 | 7.0% |
| 2 | 48852 | 6.5% |
| 3 | 48668 | 6.5% |
| Other values (21) | 149167 |
Latin
| Value | Count | Frequency (%) |
| W | 24235 | |
| d | 6143 | 12.7% |
| g | 5886 | 12.2% |
| e | 5802 | 12.0% |
| E | 5087 | 10.6% |
| r | 655 | 1.4% |
| s | 196 | 0.4% |
| N | 54 | 0.1% |
| S | 53 | 0.1% |
| º | 32 | 0.1% |
| Other values (12) | 73 | 0.2% |
Inherited
| Value | Count | Frequency (%) |
| ̊ | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 796365 | |
| None | 5529 | 0.7% |
| Punctuation | 39 | < 0.1% |
| Modifier Letters | 16 | < 0.1% |
| Diacriticals | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 7 | 78854 | |
| . | 76662 | 9.6% |
| 1 | 65451 | 8.2% |
| 8 | 61925 | 7.8% |
| 0 | 59445 | 7.5% |
| 5 | 56396 | 7.1% |
| 6 | 55542 | 7.0% |
| - | 52771 | 6.6% |
| 2 | 48852 | 6.1% |
| 3 | 48668 | 6.1% |
| Other values (36) | 191799 |
None
| Value | Count | Frequency (%) |
| ° | 5488 | |
| º | 32 | 0.6% |
| ´ | 9 | 0.2% |
Punctuation
| Value | Count | Frequency (%) |
| ′ | 16 | |
| ″ | 15 | |
| ” | 8 |
Modifier Letters
| Value | Count | Frequency (%) |
| ˚ | 16 |
Diacriticals
| Value | Count | Frequency (%) |
| ̊ | 2 |
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604717 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 8 |
| Mean length | 12.66666667 |
| Min length | 7 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Degrees Minutes Seconds |
|---|---|
| 2nd row | 9 March |
| 3rd row | 8.v.1973 |
| Value | Count | Frequency (%) |
| degrees | 1 | |
| minutes | 1 | |
| seconds | 1 | |
| 9 | 1 | |
| march | 1 | |
| 8.v.1973 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 5 | 13.2% |
| s | 3 | 7.9% |
| 3 | 7.9% | |
| c | 2 | 5.3% |
| 9 | 2 | 5.3% |
| r | 2 | 5.3% |
| M | 2 | 5.3% |
| n | 2 | 5.3% |
| . | 2 | 5.3% |
| 7 | 1 | 2.6% |
| Other values (14) | 14 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 23 | |
| Decimal Number | 6 | 15.8% |
| Uppercase Letter | 4 | 10.5% |
| Space Separator | 3 | 7.9% |
| Other Punctuation | 2 | 5.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 5 | |
| s | 3 | |
| c | 2 | 8.7% |
| r | 2 | 8.7% |
| n | 2 | 8.7% |
| v | 1 | 4.3% |
| h | 1 | 4.3% |
| a | 1 | 4.3% |
| d | 1 | 4.3% |
| o | 1 | 4.3% |
| Other values (4) | 4 |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 2 | |
| 7 | 1 | |
| 1 | 1 | |
| 8 | 1 | |
| 3 | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 2 | |
| D | 1 | |
| S | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 3 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 27 | |
| Common | 11 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 5 | |
| s | 3 | |
| c | 2 | 7.4% |
| r | 2 | 7.4% |
| M | 2 | 7.4% |
| n | 2 | 7.4% |
| v | 1 | 3.7% |
| h | 1 | 3.7% |
| a | 1 | 3.7% |
| D | 1 | 3.7% |
| Other values (7) | 7 |
Common
| Value | Count | Frequency (%) |
| 3 | ||
| 9 | 2 | |
| . | 2 | |
| 7 | 1 | 9.1% |
| 1 | 1 | 9.1% |
| 8 | 1 | 9.1% |
| 3 | 1 | 9.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 38 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 5 | 13.2% |
| s | 3 | 7.9% |
| 3 | 7.9% | |
| c | 2 | 5.3% |
| 9 | 2 | 5.3% |
| r | 2 | 5.3% |
| M | 2 | 5.3% |
| n | 2 | 5.3% |
| . | 2 | 5.3% |
| 7 | 1 | 2.6% |
| Other values (14) | 14 |
verbatimSRS
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604719 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Argia |
|---|
| Value | Count | Frequency (%) |
| argia | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 1 | |
| r | 1 | |
| g | 1 | |
| i | 1 | |
| a | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4 | |
| Uppercase Letter | 1 | 20.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 1 | |
| g | 1 | |
| i | 1 | |
| a | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 1 | |
| r | 1 | |
| g | 1 | |
| i | 1 | |
| a | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 1 | |
| r | 1 | |
| g | 1 | |
| i | 1 | |
| a | 1 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604719 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 22 |
|---|---|
| Median length | 22 |
| Mean length | 22 |
| Min length | 22 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Gynacantha membranalis |
|---|
| Value | Count | Frequency (%) |
| gynacantha | 1 | |
| membranalis | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 5 | |
| n | 3 | |
| m | 2 | 9.1% |
| G | 1 | 4.5% |
| y | 1 | 4.5% |
| c | 1 | 4.5% |
| t | 1 | 4.5% |
| h | 1 | 4.5% |
| 1 | 4.5% | |
| e | 1 | 4.5% |
| Other values (5) | 5 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 20 | |
| Uppercase Letter | 1 | 4.5% |
| Space Separator | 1 | 4.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 5 | |
| n | 3 | |
| m | 2 | 10.0% |
| y | 1 | 5.0% |
| c | 1 | 5.0% |
| t | 1 | 5.0% |
| h | 1 | 5.0% |
| e | 1 | 5.0% |
| b | 1 | 5.0% |
| r | 1 | 5.0% |
| Other values (3) | 3 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 21 | |
| Common | 1 | 4.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 5 | |
| n | 3 | |
| m | 2 | 9.5% |
| G | 1 | 4.8% |
| y | 1 | 4.8% |
| c | 1 | 4.8% |
| t | 1 | 4.8% |
| h | 1 | 4.8% |
| e | 1 | 4.8% |
| b | 1 | 4.8% |
| Other values (4) | 4 |
Common
| Value | Count | Frequency (%) |
| 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 22 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 5 | |
| n | 3 | |
| m | 2 | 9.1% |
| G | 1 | 4.5% |
| y | 1 | 4.5% |
| c | 1 | 4.5% |
| t | 1 | 4.5% |
| h | 1 | 4.5% |
| 1 | 4.5% | |
| e | 1 | 4.5% |
| Other values (5) | 5 |
georeferencedBy
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604719 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | orichalcea |
|---|
| Value | Count | Frequency (%) |
| orichalcea | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 2 | |
| a | 2 | |
| o | 1 | |
| r | 1 | |
| i | 1 | |
| h | 1 | |
| l | 1 | |
| e | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 2 | |
| a | 2 | |
| o | 1 | |
| r | 1 | |
| i | 1 | |
| h | 1 | |
| l | 1 | |
| e | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| c | 2 | |
| a | 2 | |
| o | 1 | |
| r | 1 | |
| i | 1 | |
| h | 1 | |
| l | 1 | |
| e | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| c | 2 | |
| a | 2 | |
| o | 1 | |
| r | 1 | |
| i | 1 | |
| h | 1 | |
| l | 1 | |
| e | 1 |
Missing 
| Distinct | 64 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 366819 |
| Missing (%) | 60.7% |
| Memory size | 4.6 MiB |
Length
| Max length | 72 |
|---|---|
| Median length | 12 |
| Mean length | 10.94749497 |
| Min length | 3 |
Unique
| Unique | 21 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Google Maps |
|---|---|
| 2nd row | Google Earth |
| 3rd row | Google Earth |
| 4th row | GEOLocate |
| 5th row | Google Earth |
| Value | Count | Frequency (%) |
| 163403 | ||
| earth | 120779 | |
| geolocate | 70758 | |
| maps | 42650 | 10.5% |
| gps | 1516 | 0.4% |
| coordinates | 782 | 0.2% |
| centroid | 781 | 0.2% |
| geonames | 719 | 0.2% |
| from | 711 | 0.2% |
| country | 671 | 0.2% |
| Other values (105) | 2061 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 402623 | |
| e | 238641 | |
| a | 237508 | |
| G | 236572 | |
| t | 194824 | |
| E | 191441 | |
| l | 169506 | 6.5% |
| 166930 | 6.4% | |
| g | 163835 | 6.3% |
| r | 124382 | 4.8% |
| Other values (51) | 478158 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1824030 | |
| Uppercase Letter | 612259 | 23.5% |
| Space Separator | 166930 | 6.4% |
| Decimal Number | 941 | < 0.1% |
| Other Punctuation | 250 | < 0.1% |
| Dash Punctuation | 10 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 402623 | |
| e | 238641 | |
| a | 237508 | |
| t | 194824 | |
| l | 169506 | |
| g | 163835 | |
| r | 124382 | 6.8% |
| h | 120880 | 6.6% |
| c | 72658 | 4.0% |
| s | 44356 | 2.4% |
| Other values (14) | 54817 | 3.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 236572 | |
| E | 191441 | |
| O | 70685 | 11.5% |
| L | 65530 | 10.7% |
| M | 42663 | 7.0% |
| S | 1607 | 0.3% |
| P | 1564 | 0.3% |
| C | 982 | 0.2% |
| N | 745 | 0.1% |
| B | 158 | < 0.1% |
| Other values (8) | 312 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 213 | |
| 1 | 200 | |
| 7 | 175 | |
| 2 | 170 | |
| 0 | 94 | |
| 6 | 48 | 5.1% |
| 8 | 16 | 1.7% |
| 4 | 14 | 1.5% |
| 3 | 9 | 1.0% |
| 5 | 2 | 0.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 85 | |
| & | 49 | |
| / | 48 | |
| . | 43 | |
| : | 21 | 8.4% |
| " | 2 | 0.8% |
| ; | 2 | 0.8% |
Space Separator
| Value | Count | Frequency (%) |
| 166930 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 10 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2436289 | |
| Common | 168131 | 6.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 402623 | |
| e | 238641 | |
| a | 237508 | |
| G | 236572 | |
| t | 194824 | |
| E | 191441 | |
| l | 169506 | |
| g | 163835 | |
| r | 124382 | 5.1% |
| h | 120880 | 5.0% |
| Other values (32) | 356077 |
Common
| Value | Count | Frequency (%) |
| 166930 | ||
| 9 | 213 | 0.1% |
| 1 | 200 | 0.1% |
| 7 | 175 | 0.1% |
| 2 | 170 | 0.1% |
| 0 | 94 | 0.1% |
| , | 85 | 0.1% |
| & | 49 | < 0.1% |
| / | 48 | < 0.1% |
| 6 | 48 | < 0.1% |
| Other values (9) | 119 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2604420 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 402623 | |
| e | 238641 | |
| a | 237508 | |
| G | 236572 | |
| t | 194824 | |
| E | 191441 | |
| l | 169506 | 6.5% |
| 166930 | 6.4% | |
| g | 163835 | 6.3% |
| r | 124382 | 4.8% |
| Other values (51) | 478158 |
Missing 
| Distinct | 1134 |
|---|---|
| Distinct (%) | 13.4% |
| Missing | 596270 |
| Missing (%) | 98.6% |
| Memory size | 4.6 MiB |
Length
| Max length | 200 |
|---|---|
| Median length | 182 |
| Mean length | 45.17183432 |
| Min length | 10 |
Unique
| Unique | 400 ? |
|---|---|
| Unique (%) | 4.7% |
Sample
| 1st row | Coordinate Uncertainty In Meters: 56182 |
|---|---|
| 2nd row | Coordinate Uncertainty In Meters: 49611 |
| 3rd row | Coordinate Uncertainty In Meters: 97700 |
| 4th row | Coordinate Uncertainty In Meters: 41787 |
| 5th row | Coordinate Uncertainty In Meters: 71236 |
| Value | Count | Frequency (%) |
| in | 8280 | |
| coordinate | 8141 | |
| meters | 8141 | |
| uncertainty | 8141 | |
| verbatim | 1307 | 2.7% |
| coordinate-degrees | 1307 | 2.7% |
| minutes | 1307 | 2.7% |
| 3792 | 274 | 0.6% |
| the | 221 | 0.5% |
| 6066 | 174 | 0.4% |
| Other values (1273) | 10425 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 42275 | 11.1% |
| 39268 | 10.3% | |
| t | 37520 | 9.8% |
| n | 36171 | 9.5% |
| r | 29384 | 7.7% |
| i | 21348 | 5.6% |
| o | 20139 | 5.3% |
| a | 19993 | 5.2% |
| s | 11760 | 3.1% |
| d | 9751 | 2.6% |
| Other values (59) | 114093 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 255005 | |
| Space Separator | 39268 | 10.3% |
| Decimal Number | 38776 | 10.2% |
| Uppercase Letter | 37573 | 9.8% |
| Other Punctuation | 9667 | 2.5% |
| Dash Punctuation | 1342 | 0.4% |
| Open Punctuation | 33 | < 0.1% |
| Close Punctuation | 33 | < 0.1% |
| Initial Punctuation | 2 | < 0.1% |
| Final Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 42275 | |
| t | 37520 | |
| n | 36171 | |
| r | 29384 | |
| i | 21348 | |
| o | 20139 | |
| a | 19993 | |
| s | 11760 | 4.6% |
| d | 9751 | 3.8% |
| c | 8647 | 3.4% |
| Other values (16) | 18017 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 9647 | |
| M | 8188 | |
| U | 8175 | |
| I | 8162 | |
| D | 1329 | 3.5% |
| V | 1307 | 3.5% |
| T | 264 | 0.7% |
| N | 88 | 0.2% |
| S | 85 | 0.2% |
| G | 82 | 0.2% |
| Other values (10) | 246 | 0.7% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 4555 | |
| 6 | 4451 | |
| 0 | 4424 | |
| 3 | 4272 | |
| 2 | 4116 | |
| 5 | 3998 | |
| 4 | 3413 | |
| 7 | 3301 | |
| 9 | 3147 | |
| 8 | 3099 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 8141 | |
| ; | 1326 | 13.7% |
| , | 101 | 1.0% |
| . | 90 | 0.9% |
| ' | 5 | 0.1% |
| " | 4 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 39268 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1342 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 33 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 33 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 2 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 2 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 292578 | |
| Common | 89124 | 23.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 42275 | |
| t | 37520 | |
| n | 36171 | |
| r | 29384 | |
| i | 21348 | 7.3% |
| o | 20139 | 6.9% |
| a | 19993 | 6.8% |
| s | 11760 | 4.0% |
| d | 9751 | 3.3% |
| C | 9647 | 3.3% |
| Other values (36) | 54590 |
Common
| Value | Count | Frequency (%) |
| 39268 | ||
| : | 8141 | 9.1% |
| 1 | 4555 | 5.1% |
| 6 | 4451 | 5.0% |
| 0 | 4424 | 5.0% |
| 3 | 4272 | 4.8% |
| 2 | 4116 | 4.6% |
| 5 | 3998 | 4.5% |
| 4 | 3413 | 3.8% |
| 7 | 3301 | 3.7% |
| Other values (13) | 9185 | 10.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 381693 | |
| None | 5 | < 0.1% |
| Punctuation | 4 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 42275 | 11.1% |
| 39268 | 10.3% | |
| t | 37520 | 9.8% |
| n | 36171 | 9.5% |
| r | 29384 | 7.7% |
| i | 21348 | 5.6% |
| o | 20139 | 5.3% |
| a | 19993 | 5.2% |
| s | 11760 | 3.1% |
| d | 9751 | 2.6% |
| Other values (56) | 114084 |
None
| Value | Count | Frequency (%) |
| ñ | 5 |
Punctuation
| Value | Count | Frequency (%) |
| “ | 2 | |
| ” | 2 |
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604716 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 17 |
| Mean length | 17.5 |
| Min length | 4 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Hagen in Selys |
|---|---|
| 2nd row | Brazil, [Not Stated] |
| 3rd row | United States, Florida, Pinellas |
| 4th row | Peru |
| Value | Count | Frequency (%) |
| hagen | 1 | |
| in | 1 | |
| selys | 1 | |
| brazil | 1 | |
| not | 1 | |
| stated | 1 | |
| united | 1 | |
| states | 1 | |
| florida | 1 | |
| pinellas | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 7 | 10.0% | |
| e | 7 | 10.0% |
| t | 6 | 8.6% |
| a | 6 | 8.6% |
| i | 5 | 7.1% |
| l | 5 | 7.1% |
| n | 4 | 5.7% |
| S | 3 | 4.3% |
| s | 3 | 4.3% |
| , | 3 | 4.3% |
| Other values (15) | 21 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 48 | |
| Uppercase Letter | 10 | 14.3% |
| Space Separator | 7 | 10.0% |
| Other Punctuation | 3 | 4.3% |
| Close Punctuation | 1 | 1.4% |
| Open Punctuation | 1 | 1.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 7 | |
| t | 6 | |
| a | 6 | |
| i | 5 | |
| l | 5 | |
| n | 4 | |
| s | 3 | |
| d | 3 | |
| r | 3 | |
| o | 2 | 4.2% |
| Other values (4) | 4 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 3 | |
| P | 2 | |
| F | 1 | 10.0% |
| U | 1 | 10.0% |
| H | 1 | 10.0% |
| N | 1 | 10.0% |
| B | 1 | 10.0% |
Space Separator
| Value | Count | Frequency (%) |
| 7 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 3 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 1 |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 58 | |
| Common | 12 | 17.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 7 | |
| t | 6 | |
| a | 6 | |
| i | 5 | 8.6% |
| l | 5 | 8.6% |
| n | 4 | 6.9% |
| S | 3 | 5.2% |
| s | 3 | 5.2% |
| d | 3 | 5.2% |
| r | 3 | 5.2% |
| Other values (11) | 13 |
Common
| Value | Count | Frequency (%) |
| 7 | ||
| , | 3 | |
| ] | 1 | 8.3% |
| [ | 1 | 8.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 70 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 7 | 10.0% | |
| e | 7 | 10.0% |
| t | 6 | 8.6% |
| a | 6 | 8.6% |
| i | 5 | 7.1% |
| l | 5 | 7.1% |
| n | 4 | 5.7% |
| S | 3 | 4.3% |
| s | 3 | 4.3% |
| , | 3 | 4.3% |
| Other values (15) | 21 |
earliestEonOrLowestEonothem
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604719 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 61 |
|---|---|
| Median length | 61 |
| Mean length | 61 |
| Min length | 61 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Animalia, Arthropoda, Insecta, Odonata, Anisoptera, Aeshnidae |
|---|
| Value | Count | Frequency (%) |
| animalia | 1 | |
| arthropoda | 1 | |
| insecta | 1 | |
| odonata | 1 | |
| anisoptera | 1 | |
| aeshnidae | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 8 | |
| 5 | 8.2% | |
| n | 5 | 8.2% |
| , | 5 | 8.2% |
| A | 4 | 6.6% |
| e | 4 | 6.6% |
| o | 4 | 6.6% |
| t | 4 | 6.6% |
| i | 4 | 6.6% |
| r | 3 | 4.9% |
| Other values (9) | 15 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 45 | |
| Uppercase Letter | 6 | 9.8% |
| Space Separator | 5 | 8.2% |
| Other Punctuation | 5 | 8.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 8 | |
| n | 5 | |
| e | 4 | |
| o | 4 | |
| t | 4 | |
| i | 4 | |
| r | 3 | 6.7% |
| d | 3 | 6.7% |
| s | 3 | 6.7% |
| h | 2 | 4.4% |
| Other values (4) | 5 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 4 | |
| I | 1 | 16.7% |
| O | 1 | 16.7% |
Space Separator
| Value | Count | Frequency (%) |
| 5 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 51 | |
| Common | 10 | 16.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 8 | |
| n | 5 | |
| A | 4 | |
| e | 4 | |
| o | 4 | |
| t | 4 | |
| i | 4 | |
| r | 3 | 5.9% |
| d | 3 | 5.9% |
| s | 3 | 5.9% |
| Other values (7) | 9 |
Common
| Value | Count | Frequency (%) |
| 5 | ||
| , | 5 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 61 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 8 | |
| 5 | 8.2% | |
| n | 5 | 8.2% |
| , | 5 | 8.2% |
| A | 4 | 6.6% |
| e | 4 | 6.6% |
| o | 4 | 6.6% |
| t | 4 | 6.6% |
| i | 4 | 6.6% |
| r | 3 | 4.9% |
| Other values (9) | 15 |
latestEonOrHighestEonothem
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604719 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Animalia |
|---|
| Value | Count | Frequency (%) |
| animalia | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 2 | |
| a | 2 | |
| A | 1 | |
| n | 1 | |
| m | 1 | |
| l | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7 | |
| Uppercase Letter | 1 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 2 | |
| a | 2 | |
| n | 1 | |
| m | 1 | |
| l | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 2 | |
| a | 2 | |
| A | 1 | |
| n | 1 | |
| m | 1 | |
| l | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 2 | |
| a | 2 | |
| A | 1 | |
| n | 1 | |
| m | 1 | |
| l | 1 |
earliestEraOrLowestErathem
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604719 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Arthropoda |
|---|
| Value | Count | Frequency (%) |
| arthropoda | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 2 | |
| o | 2 | |
| A | 1 | |
| t | 1 | |
| h | 1 | |
| p | 1 | |
| d | 1 | |
| a | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9 | |
| Uppercase Letter | 1 | 10.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 2 | |
| o | 2 | |
| t | 1 | |
| h | 1 | |
| p | 1 | |
| d | 1 | |
| a | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 2 | |
| o | 2 | |
| A | 1 | |
| t | 1 | |
| h | 1 | |
| p | 1 | |
| d | 1 | |
| a | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 2 | |
| o | 2 | |
| A | 1 | |
| t | 1 | |
| h | 1 | |
| p | 1 | |
| d | 1 | |
| a | 1 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604719 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Insecta |
|---|
| Value | Count | Frequency (%) |
| insecta | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 1 | |
| n | 1 | |
| s | 1 | |
| e | 1 | |
| c | 1 | |
| t | 1 | |
| a | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6 | |
| Uppercase Letter | 1 | 14.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 1 | |
| s | 1 | |
| e | 1 | |
| c | 1 | |
| t | 1 | |
| a | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| I | 1 | |
| n | 1 | |
| s | 1 | |
| e | 1 | |
| c | 1 | |
| t | 1 | |
| a | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| I | 1 | |
| n | 1 | |
| s | 1 | |
| e | 1 | |
| c | 1 | |
| t | 1 | |
| a | 1 |
earliestPeriodOrLowestSystem
Text
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604716 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 6.5 |
| Mean length | 7.5 |
| Min length | 4 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Brazil |
|---|---|
| 2nd row | United States |
| 3rd row | Odonata |
| 4th row | Peru |
| Value | Count | Frequency (%) |
| brazil | 1 | |
| united | 1 | |
| states | 1 | |
| odonata | 1 | |
| peru | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 4 | |
| t | 4 | |
| e | 3 | 10.0% |
| i | 2 | 6.7% |
| n | 2 | 6.7% |
| r | 2 | 6.7% |
| d | 2 | 6.7% |
| S | 1 | 3.3% |
| P | 1 | 3.3% |
| o | 1 | 3.3% |
| Other values (8) | 8 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 24 | |
| Uppercase Letter | 5 | 16.7% |
| Space Separator | 1 | 3.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4 | |
| t | 4 | |
| e | 3 | |
| i | 2 | |
| n | 2 | |
| r | 2 | |
| d | 2 | |
| o | 1 | 4.2% |
| s | 1 | 4.2% |
| l | 1 | 4.2% |
| Other values (2) | 2 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 1 | |
| P | 1 | |
| O | 1 | |
| B | 1 | |
| U | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 29 | |
| Common | 1 | 3.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 4 | |
| t | 4 | |
| e | 3 | |
| i | 2 | 6.9% |
| n | 2 | 6.9% |
| r | 2 | 6.9% |
| d | 2 | 6.9% |
| S | 1 | 3.4% |
| P | 1 | 3.4% |
| o | 1 | 3.4% |
| Other values (7) | 7 |
Common
| Value | Count | Frequency (%) |
| 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 30 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 4 | |
| t | 4 | |
| e | 3 | 10.0% |
| i | 2 | 6.7% |
| n | 2 | 6.7% |
| r | 2 | 6.7% |
| d | 2 | 6.7% |
| S | 1 | 3.3% |
| P | 1 | 3.3% |
| o | 1 | 3.3% |
| Other values (8) | 8 |
earliestEpochOrLowestSeries
Text
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604717 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 9 |
| Mean length | 9.333333333 |
| Min length | 7 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | [Not Stated] |
|---|---|
| 2nd row | Florida |
| 3rd row | Aeshnidae |
| Value | Count | Frequency (%) |
| not | 1 | |
| stated | 1 | |
| florida | 1 | |
| aeshnidae | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 3 | 10.7% |
| a | 3 | 10.7% |
| e | 3 | 10.7% |
| d | 3 | 10.7% |
| o | 2 | 7.1% |
| i | 2 | 7.1% |
| [ | 1 | 3.6% |
| r | 1 | 3.6% |
| h | 1 | 3.6% |
| s | 1 | 3.6% |
| Other values (8) | 8 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 21 | |
| Uppercase Letter | 4 | 14.3% |
| Open Punctuation | 1 | 3.6% |
| Close Punctuation | 1 | 3.6% |
| Space Separator | 1 | 3.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 3 | |
| a | 3 | |
| e | 3 | |
| d | 3 | |
| o | 2 | |
| i | 2 | |
| r | 1 | 4.8% |
| h | 1 | 4.8% |
| s | 1 | 4.8% |
| l | 1 | 4.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1 | |
| F | 1 | |
| N | 1 | |
| S | 1 |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 25 | |
| Common | 3 | 10.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 3 | |
| a | 3 | |
| e | 3 | |
| d | 3 | |
| o | 2 | 8.0% |
| i | 2 | 8.0% |
| r | 1 | 4.0% |
| h | 1 | 4.0% |
| s | 1 | 4.0% |
| A | 1 | 4.0% |
| Other values (5) | 5 |
Common
| Value | Count | Frequency (%) |
| [ | 1 | |
| ] | 1 | |
| 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 28 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 3 | 10.7% |
| a | 3 | 10.7% |
| e | 3 | 10.7% |
| d | 3 | 10.7% |
| o | 2 | 7.1% |
| i | 2 | 7.1% |
| [ | 1 | 3.6% |
| r | 1 | 3.6% |
| h | 1 | 3.6% |
| s | 1 | 3.6% |
| Other values (8) | 8 |
latestEpochOrHighestSeries
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604719 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Pinellas |
|---|
| Value | Count | Frequency (%) |
| pinellas | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 2 | |
| P | 1 | |
| i | 1 | |
| n | 1 | |
| e | 1 | |
| a | 1 | |
| s | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7 | |
| Uppercase Letter | 1 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 2 | |
| i | 1 | |
| n | 1 | |
| e | 1 | |
| a | 1 | |
| s | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 2 | |
| P | 1 | |
| i | 1 | |
| n | 1 | |
| e | 1 | |
| a | 1 | |
| s | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 2 | |
| P | 1 | |
| i | 1 | |
| n | 1 | |
| e | 1 | |
| a | 1 | |
| s | 1 |
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604717 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 31 |
|---|---|
| Median length | 14 |
| Mean length | 19 |
| Min length | 12 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | [Not Stated] |
|---|---|
| 2nd row | St. Petersburg |
| 3rd row | Huaru Valley, 90 mi. N. of Lima |
| Value | Count | Frequency (%) |
| not | 1 | |
| stated | 1 | |
| st | 1 | |
| petersburg | 1 | |
| huaru | 1 | |
| valley | 1 | |
| 90 | 1 | |
| mi | 1 | |
| n | 1 | |
| of | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 8 | 14.0% | |
| t | 5 | 8.8% |
| a | 4 | 7.0% |
| e | 4 | 7.0% |
| u | 3 | 5.3% |
| . | 3 | 5.3% |
| r | 3 | 5.3% |
| i | 2 | 3.5% |
| o | 2 | 3.5% |
| S | 2 | 3.5% |
| Other values (18) | 21 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 33 | |
| Space Separator | 8 | 14.0% |
| Uppercase Letter | 8 | 14.0% |
| Other Punctuation | 4 | 7.0% |
| Decimal Number | 2 | 3.5% |
| Open Punctuation | 1 | 1.8% |
| Close Punctuation | 1 | 1.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 5 | |
| a | 4 | |
| e | 4 | |
| u | 3 | |
| r | 3 | |
| i | 2 | 6.1% |
| o | 2 | 6.1% |
| l | 2 | 6.1% |
| m | 2 | 6.1% |
| f | 1 | 3.0% |
| Other values (5) | 5 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 2 | |
| N | 2 | |
| V | 1 | |
| H | 1 | |
| P | 1 | |
| L | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3 | |
| , | 1 | 25.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 1 | |
| 0 | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 8 |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 41 | |
| Common | 16 | 28.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 5 | |
| a | 4 | 9.8% |
| e | 4 | 9.8% |
| u | 3 | 7.3% |
| r | 3 | 7.3% |
| i | 2 | 4.9% |
| o | 2 | 4.9% |
| S | 2 | 4.9% |
| l | 2 | 4.9% |
| m | 2 | 4.9% |
| Other values (11) | 12 |
Common
| Value | Count | Frequency (%) |
| 8 | ||
| . | 3 | 18.8% |
| , | 1 | 6.2% |
| 9 | 1 | 6.2% |
| [ | 1 | 6.2% |
| 0 | 1 | 6.2% |
| ] | 1 | 6.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 57 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 8 | 14.0% | |
| t | 5 | 8.8% |
| a | 4 | 7.0% |
| e | 4 | 7.0% |
| u | 3 | 5.3% |
| . | 3 | 5.3% |
| r | 3 | 5.3% |
| i | 2 | 3.5% |
| o | 2 | 3.5% |
| S | 2 | 3.5% |
| Other values (18) | 21 |
lowestBiostratigraphicZone
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604719 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Gynacantha |
|---|
| Value | Count | Frequency (%) |
| gynacantha | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3 | |
| n | 2 | |
| G | 1 | 10.0% |
| y | 1 | 10.0% |
| c | 1 | 10.0% |
| t | 1 | 10.0% |
| h | 1 | 10.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9 | |
| Uppercase Letter | 1 | 10.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3 | |
| n | 2 | |
| y | 1 | 11.1% |
| c | 1 | 11.1% |
| t | 1 | 11.1% |
| h | 1 | 11.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3 | |
| n | 2 | |
| G | 1 | 10.0% |
| y | 1 | 10.0% |
| c | 1 | 10.0% |
| t | 1 | 10.0% |
| h | 1 | 10.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 3 | |
| n | 2 | |
| G | 1 | 10.0% |
| y | 1 | 10.0% |
| c | 1 | 10.0% |
| t | 1 | 10.0% |
| h | 1 | 10.0% |
formation
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604719 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 11 |
| Min length | 11 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | membranalis |
|---|
| Value | Count | Frequency (%) |
| membranalis | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| m | 2 | |
| a | 2 | |
| e | 1 | |
| b | 1 | |
| r | 1 | |
| n | 1 | |
| l | 1 | |
| i | 1 | |
| s | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 11 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| m | 2 | |
| a | 2 | |
| e | 1 | |
| b | 1 | |
| r | 1 | |
| n | 1 | |
| l | 1 | |
| i | 1 | |
| s | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| m | 2 | |
| a | 2 | |
| e | 1 | |
| b | 1 | |
| r | 1 | |
| n | 1 | |
| l | 1 | |
| i | 1 | |
| s | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| m | 2 | |
| a | 2 | |
| e | 1 | |
| b | 1 | |
| r | 1 | |
| n | 1 | |
| l | 1 | |
| i | 1 | |
| s | 1 |
Missing 
| Distinct | 16 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 603282 |
| Missing (%) | 99.8% |
| Memory size | 4.6 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 9 |
| Mean length | 5.812934631 |
| Min length | 2 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | near |
|---|---|
| 2nd row | uncertain |
| 3rd row | near |
| 4th row | near |
| 5th row | cf. |
| Value | Count | Frequency (%) |
| near | 466 | |
| uncertain | 459 | |
| cf | 238 | |
| group | 113 | 7.7% |
| subgroup | 80 | 5.4% |
| complex | 26 | 1.8% |
| aff | 21 | 1.4% |
| sp | 21 | 1.4% |
| n | 15 | 1.0% |
| sensu | 11 | 0.7% |
| Other values (6) | 24 | 1.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 1418 | |
| r | 1132 | |
| e | 962 | |
| a | 948 | |
| u | 743 | |
| c | 733 | |
| t | 481 | 5.8% |
| i | 471 | 5.6% |
| f | 280 | 3.3% |
| p | 240 | 2.9% |
| Other values (14) | 951 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8138 | |
| Other Punctuation | 180 | 2.2% |
| Space Separator | 36 | 0.4% |
| Uppercase Letter | 5 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 1418 | |
| r | 1132 | |
| e | 962 | |
| a | 948 | |
| u | 743 | |
| c | 733 | |
| t | 481 | 5.9% |
| i | 471 | 5.8% |
| f | 280 | 3.4% |
| p | 240 | 2.9% |
| Other values (9) | 730 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 2 | |
| B | 2 | |
| K | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 180 |
Space Separator
| Value | Count | Frequency (%) |
| 36 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8143 | |
| Common | 216 | 2.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 1418 | |
| r | 1132 | |
| e | 962 | |
| a | 948 | |
| u | 743 | |
| c | 733 | |
| t | 481 | 5.9% |
| i | 471 | 5.8% |
| f | 280 | 3.4% |
| p | 240 | 2.9% |
| Other values (12) | 735 |
Common
| Value | Count | Frequency (%) |
| . | 180 | |
| 36 | 16.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8359 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 1418 | |
| r | 1132 | |
| e | 962 | |
| a | 948 | |
| u | 743 | |
| c | 733 | |
| t | 481 | 5.8% |
| i | 471 | 5.6% |
| f | 280 | 3.3% |
| p | 240 | 2.9% |
| Other values (14) | 951 |
typeStatus
Text
Missing 
| Distinct | 62 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 486142 |
| Missing (%) | 80.4% |
| Memory size | 4.6 MiB |
Length
| Max length | 28 |
|---|---|
| Median length | 8 |
| Mean length | 7.058653376 |
| Min length | 1 |
Unique
| Unique | 20 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Paratype |
|---|---|
| 2nd row | Type |
| 3rd row | Holotype |
| 4th row | Type |
| 5th row | Primary Syntype |
| Value | Count | Frequency (%) |
| holotype | 54132 | |
| type | 32982 | |
| syntype | 13149 | 10.8% |
| paratype | 11029 | 9.0% |
| lectotype | 5242 | 4.3% |
| primary | 3223 | 2.6% |
| allotype | 1092 | 0.9% |
| syntypes | 429 | 0.4% |
| neotype | 316 | 0.3% |
| cotype | 298 | 0.2% |
| Other values (14) | 175 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| y | 135631 | |
| e | 124524 | |
| p | 118840 | |
| o | 115382 | |
| t | 91216 | |
| l | 56450 | |
| H | 54135 | 6.5% |
| T | 32989 | 3.9% |
| a | 25558 | 3.1% |
| r | 17612 | 2.1% |
| Other values (16) | 64664 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 711090 | |
| Uppercase Letter | 122059 | 14.6% |
| Space Separator | 3489 | 0.4% |
| Other Punctuation | 363 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| y | 135631 | |
| e | 124524 | |
| p | 118840 | |
| o | 115382 | |
| t | 91216 | |
| l | 56450 | |
| a | 25558 | 3.6% |
| r | 17612 | 2.5% |
| n | 13588 | 1.9% |
| c | 5368 | 0.8% |
| Other values (5) | 6921 | 1.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 54135 | |
| T | 32989 | |
| P | 14391 | 11.8% |
| S | 13578 | 11.1% |
| L | 5247 | 4.3% |
| A | 1094 | 0.9% |
| N | 322 | 0.3% |
| C | 303 | 0.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 254 | |
| ? | 109 |
Space Separator
| Value | Count | Frequency (%) |
| 3489 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 833149 | |
| Common | 3852 | 0.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| y | 135631 | |
| e | 124524 | |
| p | 118840 | |
| o | 115382 | |
| t | 91216 | |
| l | 56450 | |
| H | 54135 | 6.5% |
| T | 32989 | 4.0% |
| a | 25558 | 3.1% |
| r | 17612 | 2.1% |
| Other values (13) | 60812 |
Common
| Value | Count | Frequency (%) |
| 3489 | ||
| ; | 254 | 6.6% |
| ? | 109 | 2.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 837001 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| y | 135631 | |
| e | 124524 | |
| p | 118840 | |
| o | 115382 | |
| t | 91216 | |
| l | 56450 | |
| H | 54135 | 6.5% |
| T | 32989 | 3.9% |
| a | 25558 | 3.1% |
| r | 17612 | 2.1% |
| Other values (16) | 64664 |
identifiedBy
Text
Missing 
| Distinct | 2736 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 455024 |
| Missing (%) | 75.2% |
| Memory size | 4.6 MiB |
Length
| Max length | 150 |
|---|---|
| Median length | 106 |
| Mean length | 27.7928268 |
| Min length | 2 |
Unique
| Unique | 933 ? |
|---|---|
| Unique (%) | 0.6% |
Sample
| 1st row | Westfall, M. J., Jr. |
|---|---|
| 2nd row | Donnelly, Thomas W. |
| 3rd row | Flint, Oliver S., Jr., (ENT), Smithsonian Institution - National Museum of Natural History (UNITED STATES) |
| 4th row | Kormann, K. |
| 5th row | DeMarmels |
| Value | Count | Frequency (%) |
| w | 28134 | 4.4% |
| united | 24412 | 3.8% |
| states | 24411 | 3.8% |
| 22738 | 3.5% | |
| of | 22001 | 3.4% |
| s | 21919 | 3.4% |
| smithsonian | 21911 | 3.4% |
| institution | 21911 | 3.4% |
| museum | 21368 | 3.3% |
| natural | 21090 | 3.3% |
| Other values (2399) | 413103 |
Most occurring characters
| Value | Count | Frequency (%) |
| 493302 | 11.9% | |
| i | 251011 | 6.0% |
| o | 231967 | 5.6% |
| t | 230937 | 5.6% |
| n | 230507 | 5.5% |
| a | 200387 | 4.8% |
| , | 193571 | 4.7% |
| r | 182856 | 4.4% |
| . | 170385 | 4.1% |
| s | 166946 | 4.0% |
| Other values (61) | 1808606 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2295528 | |
| Uppercase Letter | 890542 | 21.4% |
| Space Separator | 493302 | 11.9% |
| Other Punctuation | 364806 | 8.8% |
| Close Punctuation | 46602 | 1.1% |
| Open Punctuation | 46602 | 1.1% |
| Dash Punctuation | 23093 | 0.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 251011 | |
| o | 231967 | |
| t | 230937 | |
| n | 230507 | |
| a | 200387 | |
| r | 182856 | |
| s | 166946 | |
| l | 162370 | |
| e | 157410 | |
| u | 114807 | 5.0% |
| Other values (23) | 366330 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 112744 | |
| S | 105400 | |
| N | 90473 | |
| E | 79714 | 9.0% |
| M | 58710 | 6.6% |
| D | 53045 | 6.0% |
| I | 47398 | 5.3% |
| A | 45391 | 5.1% |
| W | 36649 | 4.1% |
| J | 36232 | 4.1% |
| Other values (16) | 224786 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 193571 | |
| . | 170385 | |
| & | 690 | 0.2% |
| ' | 157 | < 0.1% |
| ; | 2 | < 0.1% |
| ? | 1 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 46600 | |
| ] | 2 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 46600 | |
| [ | 2 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 493302 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 23093 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3186070 | |
| Common | 974405 | 23.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 251011 | 7.9% |
| o | 231967 | 7.3% |
| t | 230937 | 7.2% |
| n | 230507 | 7.2% |
| a | 200387 | 6.3% |
| r | 182856 | 5.7% |
| s | 166946 | 5.2% |
| l | 162370 | 5.1% |
| e | 157410 | 4.9% |
| u | 114807 | 3.6% |
| Other values (49) | 1256872 |
Common
| Value | Count | Frequency (%) |
| 493302 | ||
| , | 193571 | 19.9% |
| . | 170385 | 17.5% |
| ) | 46600 | 4.8% |
| ( | 46600 | 4.8% |
| - | 23093 | 2.4% |
| & | 690 | 0.1% |
| ' | 157 | < 0.1% |
| [ | 2 | < 0.1% |
| ] | 2 | < 0.1% |
| Other values (2) | 3 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4160438 | |
| None | 37 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 493302 | 11.9% | |
| i | 251011 | 6.0% |
| o | 231967 | 5.6% |
| t | 230937 | 5.6% |
| n | 230507 | 5.5% |
| a | 200387 | 4.8% |
| , | 193571 | 4.7% |
| r | 182856 | 4.4% |
| . | 170385 | 4.1% |
| s | 166946 | 4.0% |
| Other values (54) | 1808569 |
None
| Value | Count | Frequency (%) |
| á | 9 | |
| ń | 9 | |
| ż | 9 | |
| ö | 7 | |
| ü | 1 | 2.7% |
| è | 1 | 2.7% |
| ä | 1 | 2.7% |
identifiedByID
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604718 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7.5 |
| Mean length | 7.5 |
| Min length | 7 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 27.7731 |
|---|---|
| 2nd row | -4.55006 |
| Value | Count | Frequency (%) |
| 27.7731 | 1 | |
| 4.55006 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 7 | 3 | |
| . | 2 | |
| 5 | 2 | |
| 0 | 2 | |
| 2 | 1 | 6.7% |
| 3 | 1 | 6.7% |
| 1 | 1 | 6.7% |
| - | 1 | 6.7% |
| 4 | 1 | 6.7% |
| 6 | 1 | 6.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 12 | |
| Other Punctuation | 2 | 13.3% |
| Dash Punctuation | 1 | 6.7% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 3 | |
| 5 | 2 | |
| 0 | 2 | |
| 2 | 1 | 8.3% |
| 3 | 1 | 8.3% |
| 1 | 1 | 8.3% |
| 4 | 1 | 8.3% |
| 6 | 1 | 8.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 15 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 7 | 3 | |
| . | 2 | |
| 5 | 2 | |
| 0 | 2 | |
| 2 | 1 | 6.7% |
| 3 | 1 | 6.7% |
| 1 | 1 | 6.7% |
| - | 1 | 6.7% |
| 4 | 1 | 6.7% |
| 6 | 1 | 6.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 7 | 3 | |
| . | 2 | |
| 5 | 2 | |
| 0 | 2 | |
| 2 | 1 | 6.7% |
| 3 | 1 | 6.7% |
| 1 | 1 | 6.7% |
| - | 1 | 6.7% |
| 4 | 1 | 6.7% |
| 6 | 1 | 6.7% |
dateIdentified
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604718 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 6 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | -82.64 |
|---|---|
| 2nd row | -76.1874 |
| Value | Count | Frequency (%) |
| 82.64 | 1 | |
| 76.1874 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 2 | |
| 8 | 2 | |
| . | 2 | |
| 6 | 2 | |
| 4 | 2 | |
| 7 | 2 | |
| 2 | 1 | |
| 1 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 10 | |
| Dash Punctuation | 2 | 14.3% |
| Other Punctuation | 2 | 14.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 2 | |
| 6 | 2 | |
| 4 | 2 | |
| 7 | 2 | |
| 2 | 1 | |
| 1 | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 14 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 2 | |
| 8 | 2 | |
| . | 2 | |
| 6 | 2 | |
| 4 | 2 | |
| 7 | 2 | |
| 2 | 1 | |
| 1 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 2 | |
| 8 | 2 | |
| . | 2 | |
| 6 | 2 | |
| 4 | 2 | |
| 7 | 2 | |
| 2 | 1 | |
| 1 | 1 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604719 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 18 |
| Mean length | 18 |
| Min length | 18 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | WGS 84 (EPSG:4326) |
|---|
| Value | Count | Frequency (%) |
| wgs | 1 | |
| 84 | 1 | |
| epsg:4326 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| G | 2 | |
| S | 2 | |
| 2 | ||
| 4 | 2 | |
| W | 1 | 5.6% |
| 8 | 1 | 5.6% |
| ( | 1 | 5.6% |
| E | 1 | 5.6% |
| P | 1 | 5.6% |
| : | 1 | 5.6% |
| Other values (4) | 4 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 7 | |
| Decimal Number | 6 | |
| Space Separator | 2 | 11.1% |
| Open Punctuation | 1 | 5.6% |
| Other Punctuation | 1 | 5.6% |
| Close Punctuation | 1 | 5.6% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 2 | |
| S | 2 | |
| W | 1 | |
| E | 1 | |
| P | 1 |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 2 | |
| 8 | 1 | |
| 3 | 1 | |
| 2 | 1 | |
| 6 | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 11 | |
| Latin | 7 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | ||
| 4 | 2 | |
| 8 | 1 | |
| ( | 1 | |
| : | 1 | |
| 3 | 1 | |
| 2 | 1 | |
| 6 | 1 | |
| ) | 1 |
Latin
| Value | Count | Frequency (%) |
| G | 2 | |
| S | 2 | |
| W | 1 | |
| E | 1 | |
| P | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| G | 2 | |
| S | 2 | |
| 2 | ||
| 4 | 2 | |
| W | 1 | 5.6% |
| 8 | 1 | 5.6% |
| ( | 1 | 5.6% |
| E | 1 | 5.6% |
| P | 1 | 5.6% |
| : | 1 | 5.6% |
| Other values (4) | 4 |
scientificName
Text
| Distinct | 245072 |
|---|---|
| Distinct (%) | 40.8% |
| Missing | 4631 |
| Missing (%) | 0.8% |
| Memory size | 4.6 MiB |
Length
| Max length | 68 |
|---|---|
| Median length | 61 |
| Mean length | 20.77041739 |
| Min length | 3 |
Unique
| Unique | 201386 ? |
|---|---|
| Unique (%) | 33.6% |
Sample
| 1st row | Camponotus (Myrmosericus) rufoglaucus cinctella var. rufigenis |
|---|---|
| 2nd row | Athrips mesoleuca |
| 3rd row | Paranthrene asilipennis |
| 4th row | Acanthagrion trilobatum |
| 5th row | Calathus nanulus |
| Value | Count | Frequency (%) |
| bombus | 69597 | 5.3% |
| sp | 44400 | 3.4% |
| pyrobombus | 21249 | 1.6% |
| xylocopa | 12224 | 0.9% |
| unidentified | 9030 | 0.7% |
| argia | 8665 | 0.7% |
| apis | 8603 | 0.6% |
| enallagma | 7977 | 0.6% |
| crambus | 7970 | 0.6% |
| ischnura | 7458 | 0.6% |
| Other values (130820) | 1127419 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1254120 | 10.1% |
| i | 1043361 | 8.4% |
| s | 971373 | 7.8% |
| o | 842893 | 6.8% |
| e | 820899 | 6.6% |
| 724503 | 5.8% | |
| r | 712805 | 5.7% |
| l | 623128 | 5.0% |
| u | 614998 | 4.9% |
| n | 589887 | 4.7% |
| Other values (71) | 4266132 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10815150 | |
| Space Separator | 724503 | 5.8% |
| Uppercase Letter | 692195 | 5.6% |
| Open Punctuation | 92284 | 0.7% |
| Close Punctuation | 92282 | 0.7% |
| Other Punctuation | 46370 | 0.4% |
| Decimal Number | 742 | < 0.1% |
| Connector Punctuation | 312 | < 0.1% |
| Dash Punctuation | 259 | < 0.1% |
| Final Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1254120 | |
| i | 1043361 | 9.6% |
| s | 971373 | 9.0% |
| o | 842893 | 7.8% |
| e | 820899 | 7.6% |
| r | 712805 | 6.6% |
| l | 623128 | 5.8% |
| u | 614998 | 5.7% |
| n | 589887 | 5.5% |
| t | 542919 | 5.0% |
| Other values (18) | 2798767 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 97591 | |
| B | 85599 | |
| A | 75796 | |
| C | 69821 | |
| S | 43674 | 6.3% |
| E | 42648 | 6.2% |
| L | 33325 | 4.8% |
| M | 31766 | 4.6% |
| T | 31189 | 4.5% |
| H | 29108 | 4.2% |
| Other values (16) | 151678 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 216 | |
| 9 | 110 | |
| 0 | 93 | |
| 2 | 79 | 10.6% |
| 3 | 67 | 9.0% |
| 4 | 55 | 7.4% |
| 6 | 44 | 5.9% |
| 5 | 30 | 4.0% |
| 7 | 30 | 4.0% |
| 8 | 18 | 2.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 46204 | |
| ? | 109 | 0.2% |
| # | 34 | 0.1% |
| / | 14 | < 0.1% |
| , | 4 | < 0.1% |
| ; | 2 | < 0.1% |
| ' | 2 | < 0.1% |
| ! | 1 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 92226 | |
| [ | 58 | 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 92224 | |
| ] | 58 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 724503 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 312 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 259 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 1 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11507345 | |
| Common | 956754 | 7.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1254120 | 10.9% |
| i | 1043361 | 9.1% |
| s | 971373 | 8.4% |
| o | 842893 | 7.3% |
| e | 820899 | 7.1% |
| r | 712805 | 6.2% |
| l | 623128 | 5.4% |
| u | 614998 | 5.3% |
| n | 589887 | 5.1% |
| t | 542919 | 4.7% |
| Other values (44) | 3490962 |
Common
| Value | Count | Frequency (%) |
| 724503 | ||
| ( | 92226 | 9.6% |
| ) | 92224 | 9.6% |
| . | 46204 | 4.8% |
| _ | 312 | < 0.1% |
| - | 259 | < 0.1% |
| 1 | 216 | < 0.1% |
| 9 | 110 | < 0.1% |
| ? | 109 | < 0.1% |
| 0 | 93 | < 0.1% |
| Other values (17) | 498 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12464077 | |
| None | 21 | < 0.1% |
| Punctuation | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1254120 | 10.1% |
| i | 1043361 | 8.4% |
| s | 971373 | 7.8% |
| o | 842893 | 6.8% |
| e | 820899 | 6.6% |
| 724503 | 5.8% | |
| r | 712805 | 5.7% |
| l | 623128 | 5.0% |
| u | 614998 | 4.9% |
| n | 589887 | 4.7% |
| Other values (68) | 4266110 |
None
| Value | Count | Frequency (%) |
| ö | 19 | |
| ñ | 2 | 9.5% |
Punctuation
| Value | Count | Frequency (%) |
| ” | 1 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 604719 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 12 |
| Mean length | 12 |
| Min length | 12 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Google Earth |
|---|
| Value | Count | Frequency (%) |
| 1 | ||
| earth | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 2 | |
| G | 1 | |
| g | 1 | |
| l | 1 | |
| e | 1 | |
| 1 | ||
| E | 1 | |
| a | 1 | |
| r | 1 | |
| t | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9 | |
| Uppercase Letter | 2 | 16.7% |
| Space Separator | 1 | 8.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 2 | |
| g | 1 | |
| l | 1 | |
| e | 1 | |
| a | 1 | |
| r | 1 | |
| t | 1 | |
| h | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 1 | |
| E | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11 | |
| Common | 1 | 8.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 2 | |
| G | 1 | |
| g | 1 | |
| l | 1 | |
| e | 1 | |
| E | 1 | |
| a | 1 | |
| r | 1 | |
| t | 1 | |
| h | 1 |
Common
| Value | Count | Frequency (%) |
| 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 2 | |
| G | 1 | |
| g | 1 | |
| l | 1 | |
| e | 1 | |
| 1 | ||
| E | 1 | |
| a | 1 | |
| r | 1 | |
| t | 1 |
| Distinct | 3454 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 4650 |
| Missing (%) | 0.8% |
| Memory size | 4.6 MiB |
Length
| Max length | 97 |
|---|---|
| Median length | 91 |
| Mean length | 62.39120769 |
| Min length | 9 |
Unique
| Unique | 574 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Animalia, Arthropoda, Insecta, Hymenoptera, Formicidae, Formicinae |
|---|---|
| 2nd row | Animalia, Arthropoda, Insecta, Lepidoptera, Gelechiidae, Gelechiinae |
| 3rd row | Animalia, Arthropoda, Insecta, Lepidoptera, Sesiidae, Sesiinae |
| 4th row | Animalia, Arthropoda, Insecta, Odonata, Zygoptera, Coenagrionidae |
| 5th row | Animalia, Arthropoda, Insecta, Coleoptera, Carabidae |
| Value | Count | Frequency (%) |
| arthropoda | 599790 | |
| animalia | 598420 | |
| insecta | 588007 | |
| hymenoptera | 146523 | 4.2% |
| odonata | 117300 | 3.4% |
| lepidoptera | 99955 | 2.9% |
| apidae | 82945 | 2.4% |
| diptera | 73546 | 2.1% |
| coleoptera | 72087 | 2.1% |
| apinae | 63529 | 1.8% |
| Other values (2936) | 1026199 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 4571468 | |
| e | 2938748 | 7.8% |
| 2868231 | 7.7% | |
| , | 2867865 | 7.7% |
| i | 2865509 | 7.7% |
| o | 2433279 | 6.5% |
| r | 2317205 | 6.2% |
| t | 2192393 | 5.9% |
| n | 2160394 | 5.8% |
| p | 1690401 | 4.5% |
| Other values (51) | 10533599 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 28235044 | |
| Uppercase Letter | 3467869 | 9.3% |
| Space Separator | 2868231 | 7.7% |
| Other Punctuation | 2867934 | 7.7% |
| Decimal Number | 10 | < 0.1% |
| Connector Punctuation | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4571468 | |
| e | 2938748 | |
| i | 2865509 | |
| o | 2433279 | |
| r | 2317205 | |
| t | 2192393 | |
| n | 2160394 | |
| p | 1690401 | 6.0% |
| d | 1537978 | 5.4% |
| l | 1128105 | 4.0% |
| Other values (16) | 4399564 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1474246 | |
| I | 598269 | |
| C | 245242 | 7.1% |
| H | 231754 | 6.7% |
| L | 182551 | 5.3% |
| O | 125511 | 3.6% |
| P | 113924 | 3.3% |
| D | 95383 | 2.8% |
| S | 80630 | 2.3% |
| Z | 57610 | 1.7% |
| Other values (15) | 262749 | 7.6% |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 3 | |
| 0 | 2 | |
| 1 | 2 | |
| 3 | 2 | |
| 9 | 1 | 10.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 2867865 | |
| ? | 39 | < 0.1% |
| / | 30 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 2868231 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 31702913 | |
| Common | 5736179 | 15.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 4571468 | |
| e | 2938748 | 9.3% |
| i | 2865509 | 9.0% |
| o | 2433279 | 7.7% |
| r | 2317205 | 7.3% |
| t | 2192393 | 6.9% |
| n | 2160394 | 6.8% |
| p | 1690401 | 5.3% |
| d | 1537978 | 4.9% |
| A | 1474246 | 4.7% |
| Other values (41) | 7521292 |
Common
| Value | Count | Frequency (%) |
| 2868231 | ||
| , | 2867865 | |
| ? | 39 | < 0.1% |
| / | 30 | < 0.1% |
| _ | 4 | < 0.1% |
| 6 | 3 | < 0.1% |
| 0 | 2 | < 0.1% |
| 1 | 2 | < 0.1% |
| 3 | 2 | < 0.1% |
| 9 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 37439092 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 4571468 | |
| e | 2938748 | 7.8% |
| 2868231 | 7.7% | |
| , | 2867865 | 7.7% |
| i | 2865509 | 7.7% |
| o | 2433279 | 6.5% |
| r | 2317205 | 6.2% |
| t | 2192393 | 5.9% |
| n | 2160394 | 5.8% |
| p | 1690401 | 4.5% |
| Other values (51) | 10533599 |
kingdom
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 6300 |
| Missing (%) | 1.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Animalia |
|---|---|
| 2nd row | Animalia |
| 3rd row | Animalia |
| 4th row | Animalia |
| 5th row | Animalia |
| Value | Count | Frequency (%) |
| animalia | 598420 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 1196840 | |
| a | 1196840 | |
| A | 598420 | |
| n | 598420 | |
| m | 598420 | |
| l | 598420 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4188940 | |
| Uppercase Letter | 598420 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 1196840 | |
| a | 1196840 | |
| n | 598420 | |
| m | 598420 | |
| l | 598420 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 598420 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4787360 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 1196840 | |
| a | 1196840 | |
| A | 598420 | |
| n | 598420 | |
| m | 598420 | |
| l | 598420 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4787360 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 1196840 | |
| a | 1196840 | |
| A | 598420 | |
| n | 598420 | |
| m | 598420 | |
| l | 598420 |
phylum
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4930 |
| Missing (%) | 0.8% |
| Memory size | 4.6 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Arthropoda |
|---|---|
| 2nd row | Arthropoda |
| 3rd row | Arthropoda |
| 4th row | Arthropoda |
| 5th row | Arthropoda |
| Value | Count | Frequency (%) |
| arthropoda | 599790 |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 1199580 | |
| o | 1199580 | |
| a | 599826 | |
| t | 599790 | |
| h | 599790 | |
| p | 599790 | |
| d | 599790 | |
| A | 599754 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5398146 | |
| Uppercase Letter | 599754 | 10.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 1199580 | |
| o | 1199580 | |
| a | 599826 | |
| t | 599790 | |
| h | 599790 | |
| p | 599790 | |
| d | 599790 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 599754 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5997900 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 1199580 | |
| o | 1199580 | |
| a | 599826 | |
| t | 599790 | |
| h | 599790 | |
| p | 599790 | |
| d | 599790 | |
| A | 599754 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5997900 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 1199580 | |
| o | 1199580 | |
| a | 599826 | |
| t | 599790 | |
| h | 599790 | |
| p | 599790 | |
| d | 599790 | |
| A | 599754 |
class
Text
| Distinct | 13 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5496 |
| Missing (%) | 0.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 7 |
| Mean length | 7.038307878 |
| Min length | 7 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Insecta |
|---|---|
| 2nd row | Insecta |
| 3rd row | Insecta |
| 4th row | Insecta |
| 5th row | Insecta |
| Value | Count | Frequency (%) |
| insecta | 588007 | |
| arachnida | 7908 | 1.3% |
| diplopoda | 1604 | 0.3% |
| collembola | 798 | 0.1% |
| chilopoda | 740 | 0.1% |
| diplura | 76 | < 0.1% |
| protura | 62 | < 0.1% |
| symphyla | 8 | < 0.1% |
| myriapoda | 6 | < 0.1% |
| onychophora | 6 | < 0.1% |
| Other values (3) | 9 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 607141 | |
| n | 595933 | |
| c | 595923 | |
| e | 588805 | |
| t | 588070 | |
| s | 588008 | |
| I | 588007 | |
| i | 10334 | 0.2% |
| d | 10262 | 0.2% |
| h | 8668 | 0.2% |
| Other values (18) | 36372 | 0.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3618299 | |
| Uppercase Letter | 599224 | 14.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 607141 | |
| n | 595933 | |
| c | 595923 | |
| e | 588805 | |
| t | 588070 | |
| s | 588008 | |
| i | 10334 | 0.3% |
| d | 10262 | 0.3% |
| h | 8668 | 0.2% |
| r | 8125 | 0.2% |
| Other values (9) | 17030 | 0.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 588007 | |
| A | 7908 | 1.3% |
| D | 1680 | 0.3% |
| C | 1538 | 0.3% |
| P | 66 | < 0.1% |
| S | 8 | < 0.1% |
| M | 7 | < 0.1% |
| O | 6 | < 0.1% |
| U | 4 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4217523 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 607141 | |
| n | 595933 | |
| c | 595923 | |
| e | 588805 | |
| t | 588070 | |
| s | 588008 | |
| I | 588007 | |
| i | 10334 | 0.2% |
| d | 10262 | 0.2% |
| h | 8668 | 0.2% |
| Other values (18) | 36372 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4217523 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 607141 | |
| n | 595933 | |
| c | 595923 | |
| e | 588805 | |
| t | 588070 | |
| s | 588008 | |
| I | 588007 | |
| i | 10334 | 0.2% |
| d | 10262 | 0.2% |
| h | 8668 | 0.2% |
| Other values (18) | 36372 | 0.9% |
order
Text
| Distinct | 85 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4816 |
| Missing (%) | 0.8% |
| Memory size | 4.6 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 16 |
| Mean length | 9.460972089 |
| Min length | 5 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Hymenoptera |
|---|---|
| 2nd row | Lepidoptera |
| 3rd row | Lepidoptera |
| 4th row | Odonata |
| 5th row | Coleoptera |
| Value | Count | Frequency (%) |
| hymenoptera | 146434 | |
| odonata | 117300 | |
| lepidoptera | 99929 | |
| diptera | 73541 | |
| coleoptera | 72075 | |
| hemiptera | 37773 | 6.3% |
| siphonaptera | 10088 | 1.7% |
| trichoptera | 9110 | 1.5% |
| araneae | 4645 | 0.8% |
| thysanoptera | 4630 | 0.8% |
| Other values (73) | 24379 | 4.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 849639 | |
| a | 747964 | |
| t | 600150 | |
| p | 583317 | |
| o | 554705 | |
| r | 496497 | |
| n | 284722 | 5.0% |
| i | 241800 | 4.3% |
| d | 223179 | 3.9% |
| m | 190651 | 3.4% |
| Other values (33) | 903051 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5075771 | |
| Uppercase Letter | 599902 | 10.6% |
| Other Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 849639 | |
| a | 747964 | |
| t | 600150 | |
| p | 583317 | |
| o | 554705 | |
| r | 496497 | |
| n | 284722 | 5.6% |
| i | 241800 | 4.8% |
| d | 223179 | 4.4% |
| m | 190651 | 3.8% |
| Other values (13) | 303147 | 6.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 184208 | |
| O | 119067 | |
| L | 100249 | |
| D | 73712 | |
| C | 72231 | 12.0% |
| T | 13924 | 2.3% |
| S | 10955 | 1.8% |
| P | 8233 | 1.4% |
| A | 5471 | 0.9% |
| M | 4855 | 0.8% |
| Other values (9) | 6997 | 1.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5675673 | |
| Common | 2 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 849639 | |
| a | 747964 | |
| t | 600150 | |
| p | 583317 | |
| o | 554705 | |
| r | 496497 | |
| n | 284722 | 5.0% |
| i | 241800 | 4.3% |
| d | 223179 | 3.9% |
| m | 190651 | 3.4% |
| Other values (32) | 903049 |
Common
| Value | Count | Frequency (%) |
| ? | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5675675 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 849639 | |
| a | 747964 | |
| t | 600150 | |
| p | 583317 | |
| o | 554705 | |
| r | 496497 | |
| n | 284722 | 5.0% |
| i | 241800 | 4.3% |
| d | 223179 | 3.9% |
| m | 190651 | 3.4% |
| Other values (33) | 903051 |
family
Text
| Distinct | 1481 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 4937 |
| Missing (%) | 0.8% |
| Memory size | 4.6 MiB |
Length
| Max length | 22 |
|---|---|
| Median length | 19 |
| Mean length | 10.51244367 |
| Min length | 3 |
Unique
| Unique | 207 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Formicidae |
|---|---|
| 2nd row | Gelechiidae |
| 3rd row | Sesiidae |
| 4th row | Coenagrionidae |
| 5th row | Carabidae |
| Value | Count | Frequency (%) |
| apidae | 82945 | 13.8% |
| libellulidae | 42510 | 7.1% |
| coenagrionidae | 35189 | 5.9% |
| chrysomelidae | 17542 | 2.9% |
| asilidae | 13404 | 2.2% |
| geometridae | 12783 | 2.1% |
| crambidae | 12086 | 2.0% |
| curculionidae | 12016 | 2.0% |
| psychodidae | 11788 | 2.0% |
| formicidae | 9927 | 1.7% |
| Other values (1470) | 349958 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 913350 | |
| e | 888633 | |
| a | 818158 | |
| d | 670599 | |
| o | 326213 | 5.2% |
| l | 321141 | 5.1% |
| r | 288681 | 4.6% |
| p | 212863 | 3.4% |
| n | 209521 | 3.3% |
| h | 149416 | 2.4% |
| Other values (49) | 1506610 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5705019 | |
| Uppercase Letter | 599782 | 9.5% |
| Space Separator | 365 | < 0.1% |
| Decimal Number | 10 | < 0.1% |
| Other Punctuation | 5 | < 0.1% |
| Connector Punctuation | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 913350 | |
| e | 888633 | |
| a | 818158 | |
| d | 670599 | |
| o | 326213 | 5.7% |
| l | 321141 | 5.6% |
| r | 288681 | 5.1% |
| p | 212863 | 3.7% |
| n | 209521 | 3.7% |
| h | 149416 | 2.6% |
| Other values (16) | 906444 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 135683 | |
| A | 124194 | |
| L | 64733 | |
| P | 62029 | |
| S | 32302 | 5.4% |
| T | 31841 | 5.3% |
| G | 26661 | 4.4% |
| M | 18029 | 3.0% |
| N | 17189 | 2.9% |
| E | 13602 | 2.3% |
| Other values (15) | 73519 |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 3 | |
| 0 | 2 | |
| 1 | 2 | |
| 3 | 2 | |
| 9 | 1 | 10.0% |
Space Separator
| Value | Count | Frequency (%) |
| 365 |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 5 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6304801 | |
| Common | 384 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 913350 | |
| e | 888633 | |
| a | 818158 | |
| d | 670599 | |
| o | 326213 | 5.2% |
| l | 321141 | 5.1% |
| r | 288681 | 4.6% |
| p | 212863 | 3.4% |
| n | 209521 | 3.3% |
| h | 149416 | 2.4% |
| Other values (41) | 1506226 |
Common
| Value | Count | Frequency (%) |
| 365 | ||
| ? | 5 | 1.3% |
| _ | 4 | 1.0% |
| 6 | 3 | 0.8% |
| 0 | 2 | 0.5% |
| 1 | 2 | 0.5% |
| 3 | 2 | 0.5% |
| 9 | 1 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6305185 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 913350 | |
| e | 888633 | |
| a | 818158 | |
| d | 670599 | |
| o | 326213 | 5.2% |
| l | 321141 | 5.1% |
| r | 288681 | 4.6% |
| p | 212863 | 3.4% |
| n | 209521 | 3.3% |
| h | 149416 | 2.4% |
| Other values (49) | 1506610 |
genus
Text
| Distinct | 39740 |
|---|---|
| Distinct (%) | 6.6% |
| Missing | 5432 |
| Missing (%) | 0.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 21 |
| Mean length | 8.981117593 |
| Min length | 1 |
Unique
| Unique | 14840 ? |
|---|---|
| Unique (%) | 2.5% |
Sample
| 1st row | Camponotus |
|---|---|
| 2nd row | Athrips |
| 3rd row | Paranthrene |
| 4th row | Acanthagrion |
| 5th row | Calathus |
| Value | Count | Frequency (%) |
| bombus | 62372 | 10.4% |
| xylocopa | 12105 | 2.0% |
| unidentified | 8808 | 1.5% |
| argia | 8662 | 1.4% |
| enallagma | 7977 | 1.3% |
| crambus | 7970 | 1.3% |
| ischnura | 7458 | 1.2% |
| sympetrum | 6028 | 1.0% |
| apis | 4969 | 0.8% |
| lestes | 4236 | 0.7% |
| Other values (39686) | 468802 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 530794 | 9.9% |
| o | 471993 | 8.8% |
| i | 398632 | 7.4% |
| s | 398294 | 7.4% |
| e | 380922 | 7.1% |
| r | 324165 | 6.0% |
| l | 256048 | 4.8% |
| u | 248374 | 4.6% |
| t | 243058 | 4.5% |
| m | 234489 | 4.4% |
| Other values (62) | 1895507 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4782652 | |
| Uppercase Letter | 599244 | 11.1% |
| Space Separator | 99 | < 0.1% |
| Open Punctuation | 77 | < 0.1% |
| Close Punctuation | 77 | < 0.1% |
| Other Punctuation | 68 | < 0.1% |
| Decimal Number | 31 | < 0.1% |
| Connector Punctuation | 23 | < 0.1% |
| Dash Punctuation | 4 | < 0.1% |
| Final Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 530794 | |
| o | 471993 | 9.9% |
| i | 398632 | 8.3% |
| s | 398294 | 8.3% |
| e | 380922 | 8.0% |
| r | 324165 | 6.8% |
| l | 256048 | 5.4% |
| u | 248374 | 5.2% |
| t | 243058 | 5.1% |
| m | 234489 | 4.9% |
| Other values (17) | 1295883 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 76921 | |
| P | 69062 | |
| A | 66431 | |
| C | 64157 | |
| E | 40523 | 6.8% |
| S | 37144 | 6.2% |
| L | 31252 | 5.2% |
| T | 27870 | 4.7% |
| H | 27834 | 4.6% |
| M | 26250 | 4.4% |
| Other values (16) | 131800 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 10 | |
| 1 | 7 | |
| 3 | 5 | |
| 2 | 3 | 9.7% |
| 6 | 3 | 9.7% |
| 4 | 2 | 6.5% |
| 9 | 1 | 3.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 48 | |
| . | 16 | 23.5% |
| / | 3 | 4.4% |
| ! | 1 | 1.5% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 55 | |
| ( | 22 | 28.6% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 55 | |
| ) | 22 | 28.6% |
Space Separator
| Value | Count | Frequency (%) |
| 99 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 23 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5381896 | |
| Common | 380 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 530794 | 9.9% |
| o | 471993 | 8.8% |
| i | 398632 | 7.4% |
| s | 398294 | 7.4% |
| e | 380922 | 7.1% |
| r | 324165 | 6.0% |
| l | 256048 | 4.8% |
| u | 248374 | 4.6% |
| t | 243058 | 4.5% |
| m | 234489 | 4.4% |
| Other values (43) | 1895127 |
Common
| Value | Count | Frequency (%) |
| 99 | ||
| [ | 55 | |
| ] | 55 | |
| ? | 48 | |
| _ | 23 | 6.1% |
| ( | 22 | 5.8% |
| ) | 22 | 5.8% |
| . | 16 | 4.2% |
| 0 | 10 | 2.6% |
| 1 | 7 | 1.8% |
| Other values (9) | 23 | 6.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5382258 | |
| None | 17 | < 0.1% |
| Punctuation | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 530794 | 9.9% |
| o | 471993 | 8.8% |
| i | 398632 | 7.4% |
| s | 398294 | 7.4% |
| e | 380922 | 7.1% |
| r | 324165 | 6.0% |
| l | 256048 | 4.8% |
| u | 248374 | 4.6% |
| t | 243058 | 4.5% |
| m | 234489 | 4.4% |
| Other values (60) | 1895489 |
None
| Value | Count | Frequency (%) |
| ö | 17 |
Punctuation
| Value | Count | Frequency (%) |
| ” | 1 |
subgenus
Text
Missing 
| Distinct | 3170 |
|---|---|
| Distinct (%) | 3.4% |
| Missing | 512525 |
| Missing (%) | 84.8% |
| Memory size | 4.6 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 18 |
| Mean length | 9.945918976 |
| Min length | 1 |
Unique
| Unique | 1134 ? |
|---|---|
| Unique (%) | 1.2% |
Sample
| 1st row | Myrmosericus |
|---|---|
| 2nd row | Anomalagrion |
| 3rd row | Anomalagrion |
| 4th row | Hypocaccus |
| 5th row | Bombus |
| Value | Count | Frequency (%) |
| pyrobombus | 21248 | |
| bombus | 7225 | 7.8% |
| apis | 3633 | 3.9% |
| fervidobombus | 3293 | 3.6% |
| neoxylocopa | 2426 | 2.6% |
| alpinobombus | 1554 | 1.7% |
| xylocopoides | 1492 | 1.6% |
| schonnherria | 1460 | 1.6% |
| separatobombus | 1325 | 1.4% |
| chimarra | 1296 | 1.4% |
| Other values (3159) | 47264 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 129387 | |
| s | 73941 | 8.1% |
| b | 73482 | 8.0% |
| r | 63107 | 6.9% |
| m | 58165 | 6.3% |
| u | 57827 | 6.3% |
| a | 57453 | 6.3% |
| i | 52141 | 5.7% |
| y | 39777 | 4.3% |
| e | 39462 | 4.3% |
| Other values (47) | 272222 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 824673 | |
| Uppercase Letter | 92195 | 10.1% |
| Other Punctuation | 74 | < 0.1% |
| Space Separator | 21 | < 0.1% |
| Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 129387 | |
| s | 73941 | |
| b | 73482 | |
| r | 63107 | 7.7% |
| m | 58165 | 7.1% |
| u | 57827 | 7.0% |
| a | 57453 | 7.0% |
| i | 52141 | 6.3% |
| y | 39777 | 4.8% |
| e | 39462 | 4.8% |
| Other values (17) | 179931 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 28486 | |
| A | 9332 | 10.1% |
| B | 8660 | 9.4% |
| S | 6505 | 7.1% |
| M | 5495 | 6.0% |
| C | 5417 | 5.9% |
| N | 4425 | 4.8% |
| F | 4007 | 4.3% |
| T | 3288 | 3.6% |
| D | 2690 | 2.9% |
| Other values (16) | 13890 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 70 | |
| ? | 4 | 5.4% |
Space Separator
| Value | Count | Frequency (%) |
| 21 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 916868 | |
| Common | 96 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 129387 | |
| s | 73941 | 8.1% |
| b | 73482 | 8.0% |
| r | 63107 | 6.9% |
| m | 58165 | 6.3% |
| u | 57827 | 6.3% |
| a | 57453 | 6.3% |
| i | 52141 | 5.7% |
| y | 39777 | 4.3% |
| e | 39462 | 4.3% |
| Other values (43) | 272126 |
Common
| Value | Count | Frequency (%) |
| . | 70 | |
| 21 | 21.9% | |
| ? | 4 | 4.2% |
| - | 1 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 916963 | |
| None | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 129387 | |
| s | 73941 | 8.1% |
| b | 73482 | 8.0% |
| r | 63107 | 6.9% |
| m | 58165 | 6.3% |
| u | 57827 | 6.3% |
| a | 57453 | 6.3% |
| i | 52141 | 5.7% |
| y | 39777 | 4.3% |
| e | 39462 | 4.3% |
| Other values (46) | 272221 |
None
| Value | Count | Frequency (%) |
| ö | 1 |
specificEpithet
Text
Missing 
| Distinct | 88940 |
|---|---|
| Distinct (%) | 14.9% |
| Missing | 8751 |
| Missing (%) | 1.4% |
| Memory size | 4.6 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 25 |
| Mean length | 8.294070665 |
| Min length | 1 |
Unique
| Unique | 50119 ? |
|---|---|
| Unique (%) | 8.4% |
Sample
| 1st row | rufoglaucus |
|---|---|
| 2nd row | mesoleuca |
| 3rd row | asilipennis |
| 4th row | trilobatum |
| 5th row | nanulus |
| Value | Count | Frequency (%) |
| sp | 44400 | 7.4% |
| sylvicola | 6285 | 1.1% |
| bifarius | 4078 | 0.7% |
| kirbyellus | 3621 | 0.6% |
| flavifrons | 3483 | 0.6% |
| impatiens | 3134 | 0.5% |
| undetermined | 3047 | 0.5% |
| nevadensis | 2529 | 0.4% |
| cerana | 2431 | 0.4% |
| affinis | 2295 | 0.4% |
| Other values (88797) | 521298 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 624165 | |
| i | 556726 | |
| s | 471813 | 9.5% |
| e | 377708 | 7.6% |
| n | 324342 | 6.6% |
| l | 324007 | 6.6% |
| r | 301739 | 6.1% |
| u | 289537 | 5.9% |
| t | 260984 | 5.3% |
| c | 231687 | 4.7% |
| Other values (43) | 1180301 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4896540 | |
| Other Punctuation | 44585 | 0.9% |
| Decimal Number | 697 | < 0.1% |
| Space Separator | 632 | < 0.1% |
| Connector Punctuation | 289 | < 0.1% |
| Dash Punctuation | 249 | < 0.1% |
| Open Punctuation | 8 | < 0.1% |
| Close Punctuation | 8 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 624165 | |
| i | 556726 | |
| s | 471813 | |
| e | 377708 | 7.7% |
| n | 324342 | 6.6% |
| l | 324007 | 6.6% |
| r | 301739 | 6.2% |
| u | 289537 | 5.9% |
| t | 260984 | 5.3% |
| c | 231687 | 4.7% |
| Other values (18) | 1133832 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 205 | |
| 9 | 106 | |
| 0 | 83 | |
| 2 | 72 | 10.3% |
| 3 | 59 | 8.5% |
| 4 | 53 | 7.6% |
| 6 | 41 | 5.9% |
| 7 | 30 | 4.3% |
| 5 | 30 | 4.3% |
| 8 | 18 | 2.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 44494 | |
| ? | 43 | 0.1% |
| # | 34 | 0.1% |
| / | 9 | < 0.1% |
| ' | 2 | < 0.1% |
| ; | 2 | < 0.1% |
| , | 1 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 5 | |
| [ | 3 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 5 | |
| ] | 3 |
Space Separator
| Value | Count | Frequency (%) |
| 632 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 289 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 249 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4896540 | |
| Common | 46469 | 0.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 624165 | |
| i | 556726 | |
| s | 471813 | |
| e | 377708 | 7.7% |
| n | 324342 | 6.6% |
| l | 324007 | 6.6% |
| r | 301739 | 6.2% |
| u | 289537 | 5.9% |
| t | 260984 | 5.3% |
| c | 231687 | 4.7% |
| Other values (18) | 1133832 |
Common
| Value | Count | Frequency (%) |
| . | 44494 | |
| 632 | 1.4% | |
| _ | 289 | 0.6% |
| - | 249 | 0.5% |
| 1 | 205 | 0.4% |
| 9 | 106 | 0.2% |
| 0 | 83 | 0.2% |
| 2 | 72 | 0.2% |
| 3 | 59 | 0.1% |
| 4 | 53 | 0.1% |
| Other values (15) | 227 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4943006 | |
| None | 3 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 624165 | |
| i | 556726 | |
| s | 471813 | 9.5% |
| e | 377708 | 7.6% |
| n | 324342 | 6.6% |
| l | 324007 | 6.6% |
| r | 301739 | 6.1% |
| u | 289537 | 5.9% |
| t | 260984 | 5.3% |
| c | 231687 | 4.7% |
| Other values (41) | 1180298 |
None
| Value | Count | Frequency (%) |
| ñ | 2 | |
| ö | 1 |
Missing 
| Distinct | 8352 |
|---|---|
| Distinct (%) | 24.9% |
| Missing | 571231 |
| Missing (%) | 94.5% |
| Memory size | 4.6 MiB |
Length
| Max length | 36 |
|---|---|
| Median length | 22 |
| Mean length | 8.8483084 |
| Min length | 1 |
Unique
| Unique | 5802 ? |
|---|---|
| Unique (%) | 17.3% |
Sample
| 1st row | rufigenis |
|---|---|
| 2nd row | decrescens |
| 3rd row | marianae |
| 4th row | neglectum |
| 5th row | lavatus |
| Value | Count | Frequency (%) |
| nearcticus | 2527 | 7.5% |
| fervidus | 1188 | 3.5% |
| violacea | 992 | 3.0% |
| pensylvanicus | 904 | 2.7% |
| vagans | 870 | 2.6% |
| portia | 724 | 2.2% |
| virginica | 593 | 1.8% |
| auricormus | 587 | 1.8% |
| auripennis | 578 | 1.7% |
| dorsata | 440 | 1.3% |
| Other values (8332) | 24136 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 38963 | |
| i | 34627 | |
| s | 26724 | |
| n | 23021 | 7.8% |
| r | 21553 | 7.3% |
| e | 21399 | 7.2% |
| u | 19000 | 6.4% |
| c | 18314 | 6.2% |
| t | 14569 | 4.9% |
| o | 13863 | 4.7% |
| Other values (28) | 64288 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 296210 | |
| Space Separator | 50 | < 0.1% |
| Other Punctuation | 39 | < 0.1% |
| Uppercase Letter | 6 | < 0.1% |
| Dash Punctuation | 5 | < 0.1% |
| Open Punctuation | 4 | < 0.1% |
| Close Punctuation | 4 | < 0.1% |
| Decimal Number | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 38963 | |
| i | 34627 | |
| s | 26724 | |
| n | 23021 | 7.8% |
| r | 21553 | 7.3% |
| e | 21399 | 7.2% |
| u | 19000 | 6.4% |
| c | 18314 | 6.2% |
| t | 14569 | 4.9% |
| o | 13863 | 4.7% |
| Other values (16) | 64177 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 23 | |
| ? | 14 | |
| / | 2 | 5.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 6 | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 5 | |
| C | 1 | 16.7% |
Space Separator
| Value | Count | Frequency (%) |
| 50 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 5 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 4 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 296216 | |
| Common | 105 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 38963 | |
| i | 34627 | |
| s | 26724 | |
| n | 23021 | 7.8% |
| r | 21553 | 7.3% |
| e | 21399 | 7.2% |
| u | 19000 | 6.4% |
| c | 18314 | 6.2% |
| t | 14569 | 4.9% |
| o | 13863 | 4.7% |
| Other values (18) | 64183 |
Common
| Value | Count | Frequency (%) |
| 50 | ||
| . | 23 | |
| ? | 14 | 13.3% |
| - | 5 | 4.8% |
| ( | 4 | 3.8% |
| ) | 4 | 3.8% |
| / | 2 | 1.9% |
| 1 | 1 | 1.0% |
| 2 | 1 | 1.0% |
| 6 | 1 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 296321 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 38963 | |
| i | 34627 | |
| s | 26724 | |
| n | 23021 | 7.8% |
| r | 21553 | 7.3% |
| e | 21399 | 7.2% |
| u | 19000 | 6.4% |
| c | 18314 | 6.2% |
| t | 14569 | 4.9% |
| o | 13863 | 4.7% |
| Other values (28) | 64288 |
taxonRank
Text
Missing 
| Distinct | 17 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 571236 |
| Missing (%) | 94.5% |
| Memory size | 4.6 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 10 |
| Mean length | 9.835861904 |
| Min length | 4 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Variety |
|---|---|
| 2nd row | subspecies |
| 3rd row | subspecies |
| 4th row | subspecies |
| 5th row | subspecies |
| Value | Count | Frequency (%) |
| subspecies | 31600 | |
| variety | 1483 | 4.4% |
| aberration | 168 | 0.5% |
| form | 104 | 0.3% |
| race | 69 | 0.2% |
| morphotype | 28 | 0.1% |
| species | 10 | < 0.1% |
| group | 10 | < 0.1% |
| undet.cat | 9 | < 0.1% |
| var | 5 | < 0.1% |
| Other values (4) | 10 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 94823 | |
| e | 64977 | |
| i | 33273 | 10.1% |
| b | 31768 | 9.6% |
| p | 31681 | 9.6% |
| c | 31679 | 9.6% |
| u | 31610 | 9.6% |
| r | 1986 | 0.6% |
| a | 1745 | 0.5% |
| t | 1706 | 0.5% |
| Other values (20) | 4096 | 1.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 327473 | |
| Uppercase Letter | 1836 | 0.6% |
| Other Punctuation | 23 | < 0.1% |
| Space Separator | 12 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 94823 | |
| e | 64977 | |
| i | 33273 | 10.2% |
| b | 31768 | 9.7% |
| p | 31681 | 9.7% |
| c | 31679 | 9.7% |
| u | 31610 | 9.7% |
| r | 1986 | 0.6% |
| a | 1745 | 0.5% |
| t | 1706 | 0.5% |
| Other values (10) | 2225 | 0.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 1470 | |
| A | 165 | 9.0% |
| F | 92 | 5.0% |
| R | 58 | 3.2% |
| M | 28 | 1.5% |
| U | 9 | 0.5% |
| C | 9 | 0.5% |
| S | 5 | 0.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 23 |
Space Separator
| Value | Count | Frequency (%) |
| 12 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 329309 | |
| Common | 35 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 94823 | |
| e | 64977 | |
| i | 33273 | 10.1% |
| b | 31768 | 9.6% |
| p | 31681 | 9.6% |
| c | 31679 | 9.6% |
| u | 31610 | 9.6% |
| r | 1986 | 0.6% |
| a | 1745 | 0.5% |
| t | 1706 | 0.5% |
| Other values (18) | 4061 | 1.2% |
Common
| Value | Count | Frequency (%) |
| . | 23 | |
| 12 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 329344 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 94823 | |
| e | 64977 | |
| i | 33273 | 10.1% |
| b | 31768 | 9.6% |
| p | 31681 | 9.6% |
| c | 31679 | 9.6% |
| u | 31610 | 9.6% |
| r | 1986 | 0.6% |
| a | 1745 | 0.5% |
| t | 1706 | 0.5% |
| Other values (20) | 4096 | 1.2% |
Missing 
| Distinct | 10001 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 90502 |
| Missing (%) | 15.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 43 |
|---|---|
| Median length | 33 |
| Mean length | 7.761809194 |
| Min length | 2 |
Unique
| Unique | 3229 ? |
|---|---|
| Unique (%) | 0.6% |
Sample
| 1st row | Forel |
|---|---|
| 2nd row | (Lower) |
| 3rd row | (Guérin-Méneville) |
| 4th row | Leonard |
| 5th row | Casey |
| Value | Count | Frequency (%) |
| 25801 | 4.4% | |
| hagen | 24579 | 4.1% |
| cresson | 22178 | 3.7% |
| selys | 21328 | 3.6% |
| casey | 19749 | 3.3% |
| say | 14238 | 2.4% |
| fabricius | 13983 | 2.4% |
| alexander | 9897 | 1.7% |
| smith | 9578 | 1.6% |
| kirby | 8910 | 1.5% |
| Other values (6005) | 422402 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 427913 | 10.7% |
| a | 306441 | 7.7% |
| r | 298135 | 7.5% |
| n | 241900 | 6.1% |
| s | 234839 | 5.9% |
| i | 207245 | 5.2% |
| l | 195525 | 4.9% |
| o | 172511 | 4.3% |
| ( | 140296 | 3.5% |
| ) | 140296 | 3.5% |
| Other values (73) | 1626161 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3036622 | |
| Uppercase Letter | 564533 | 14.1% |
| Open Punctuation | 140297 | 3.5% |
| Close Punctuation | 140297 | 3.5% |
| Space Separator | 78425 | 2.0% |
| Other Punctuation | 27987 | 0.7% |
| Dash Punctuation | 3101 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 427913 | |
| a | 306441 | |
| r | 298135 | |
| n | 241900 | 8.0% |
| s | 234839 | 7.7% |
| i | 207245 | 6.8% |
| l | 195525 | 6.4% |
| o | 172511 | 5.7% |
| t | 118977 | 3.9% |
| u | 111745 | 3.7% |
| Other values (36) | 721391 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 79152 | |
| S | 78874 | |
| H | 48773 | 8.6% |
| B | 48507 | 8.6% |
| M | 33646 | 6.0% |
| D | 32425 | 5.7% |
| F | 31311 | 5.5% |
| W | 28701 | 5.1% |
| L | 28670 | 5.1% |
| R | 25232 | 4.5% |
| Other values (17) | 129242 |
Other Punctuation
| Value | Count | Frequency (%) |
| & | 25801 | |
| . | 1801 | 6.4% |
| ' | 383 | 1.4% |
| , | 2 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 140296 | |
| [ | 1 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 140296 | |
| ] | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 78425 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3101 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3601155 | |
| Common | 390107 | 9.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 427913 | 11.9% |
| a | 306441 | 8.5% |
| r | 298135 | 8.3% |
| n | 241900 | 6.7% |
| s | 234839 | 6.5% |
| i | 207245 | 5.8% |
| l | 195525 | 5.4% |
| o | 172511 | 4.8% |
| t | 118977 | 3.3% |
| u | 111745 | 3.1% |
| Other values (63) | 1285924 |
Common
| Value | Count | Frequency (%) |
| ( | 140296 | |
| ) | 140296 | |
| 78425 | ||
| & | 25801 | 6.6% |
| - | 3101 | 0.8% |
| . | 1801 | 0.5% |
| ' | 383 | 0.1% |
| , | 2 | < 0.1% |
| [ | 1 | < 0.1% |
| ] | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3984593 | |
| None | 6669 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 427913 | 10.7% |
| a | 306441 | 7.7% |
| r | 298135 | 7.5% |
| n | 241900 | 6.1% |
| s | 234839 | 5.9% |
| i | 207245 | 5.2% |
| l | 195525 | 4.9% |
| o | 172511 | 4.3% |
| ( | 140296 | 3.5% |
| ) | 140296 | 3.5% |
| Other values (52) | 1619492 |
None
| Value | Count | Frequency (%) |
| é | 2819 | |
| ü | 1605 | |
| ö | 1059 | 15.9% |
| á | 557 | 8.4% |
| ä | 442 | 6.6% |
| ã | 32 | 0.5% |
| ý | 22 | 0.3% |
| ó | 21 | 0.3% |
| ç | 19 | 0.3% |
| è | 17 | 0.3% |
| Other values (11) | 76 | 1.1% |
vernacularName
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 604718 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Type |
|---|---|
| 2nd row | Type |
| Value | Count | Frequency (%) |
| type | 2 |
Most occurring characters
| Value | Count | Frequency (%) |
| T | 2 | |
| y | 2 | |
| p | 2 | |
| e | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6 | |
| Uppercase Letter | 2 | 25.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| y | 2 | |
| p | 2 | |
| e | 2 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| T | 2 | |
| y | 2 | |
| p | 2 | |
| e | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| T | 2 | |
| y | 2 | |
| p | 2 | |
| e | 2 |